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(57) Abstract: The onset and progression of chronic autoimmune diseases, including human rheumatoid arthritis (RA) are likely 
determined by differential expression of genes that influence inflammatory and immune responses. The collagen-induced arthritis 
(CIA) mouse model for RA exhibits many of the same genetic and immunological features of RA; however, the profiles of gene 
expression during the inflammatory and immune responses of CIA or RA have not been well characterized. Previous studies have 
demonstrated that mRNA levels, particularly that of cytokines, can change over the course of CIA. To determine the contribution of 
various genes in the pathogenesis of CIA, microarray technology was used to simultaneously monitor 8,734 taiget cDNAs to discover 
arthritic stage-specific genes. The resulting gene expression profile identified 333 genes that were at least 2-fold up-regulated in all 
synovial samples: normal, acute disease and chronic disease. In addition, 385 disease-specific genes were identified that were greater 
than or equal to 2-fold over- or under-expressed in the disease state as compared to normal synovium. Clustering analysis among 
the arthritic states allowed for the identification of four distinct kinetic expression patterns based on differential expression levels in 
normal, acute disease and chronic disease synovial samples. 
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METHOD FOR DIAGNOSIS AND TREATMENT OF RHEUMATOID ARTHRITIS 

Government Interest in the Invention 
[00011 Certain aspects of the invention disclosed herein were made with United States 
government support under National Institutes of Health grants AD4958, AR44059, AR47712, and 
AR42632. The United States government has certain rights in these aspects of the invention. 
Tncor poration-B v-Reference Of CD-ROM Data 
[0002] Applicants hereby incorporate by reference in their entirety two copies of a 
compact disc, labeled "Copy 1" and "Copy 2," respectively, containing tablel.l.txt, 2,276,363 size 
in bytes, created on October 31, 2002; tablel.2.txt, 1,335,492 size in bytes, created on October 31, 
2002; tablel.3.txt, 2,924,772 size in bytes, created on October 31, 2002; table2.1.txt, 817,381 size 
in bytes, created on October 31, 2002; table2.2.txt, 1,003,344 size in bytes, created on October 31, 
2002; and table2.3.txt, 604,772 size in bytes, created on October 31, 2002. 

Pj^pkgTPUT^d of thg Invention 

Field ofthglnvCTtion 

[0003] The invention relates generally to materials and methods for diagnosis and 
treatment of rheumatoid arthritis (RA) and related conditions. More specifically, Ihe invention 
relates to nucleic acids, proteins, arrays thereof, methods for diagnosis and methods for analyzing 
the severity of RA and related conditions using, for example, patterns of up- and down-regulation 
of specific genes identified by microarray technology. The invention further relates to the treatment 
of RA by activating those genes or proteins that are down-regulated and/or inhibiting those genes or 
proteins that are up-regulated. The invention also relates to identifying and using targets for drug 
treatment, methods of screening candidate drugs, and methods for identifying optimal treatment 
approaches for a specific patient 
Description of the Related Art 

[0004] Collagen-induced arthritis (CIA) in mice has been utilized to study underlymg 
mechanisms of autoimmune arthritis because of its clinical, histologic, immunologic and genetic 
similarity to rheumatoid arthritis (RA). Although several immunoregulatory genes have been 
implicated in this model system, molecular mechanisms underlying the pathophysiology have only 
been partially defined. 

[0005] In CIA, progression of disease is associated with changes in the cell types 
infiltrating the joint. The acute phase of the disease is characterized by a predominantly 
neutrophilic infiltrate, with monocytes and lymphocytes constituting approximately 5% of the 
inflammatory cell population. By day 49, a decrease in lymphocytes is observed, with an increase 
in fibroblast/macrophage type cells and an increasingly fibrotic appearance. In conjunction with the 
changes of cellular infiltrate, mRNA and protein expression levels of several cytokines and 
chemokines also change over the course of disease. For example, TNFa protein expression in the 
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joint precedes that of IL-lp and IFN-y is expressed shortly after disease onset, but not late in 
disease. DL-lp and IL-IO niRNAs, but not those of IFN-y and IL-5, are detected in late disease. 

[0006] Classical approaches to studying inflammatory mediators in arthritis have 
focused on identifying and analyzing these mediators individually. While this method has proven 
extremely productive, arthritis represents a complex and multifactorial pathophysiology that likely 
involves hundreds or thousands of individual gene products acting in concert, hnproved 
understanding of the genes that are operative during the development of the inflammatory lesion 
may aid in the design of disease-specific therapies. Several methods to examine coordinated gene 
expression have been developed, including Northern blot, ribonuclease protection assay (RPA), • 
differential display and sequencing of cDNA libraries and expressed sequence tags (ESTs), Using 
total paw RNA from a mouse with CIA and using the method of RPA, the inventors have 
previously demonstrated distinct changes in mRNA expression of a number of cytokines in early 
and late CIA. IL-2, IL-6, MIP2 and IL-lp were found predominantly in early disease, whereas, 
TGFP was found predominantly in late disease. IL-1 1, IL-lra, MlPla, RANTES, TNFa and TNPP 
were present both in early and late disease. These changes in gene expression within the joint likely 
affect the disease pathology observed at the cellular and macroscopic level. Whether a similar 
temporal change in cellular infiltrate and mRNA expression profiles also occurs in RA is not clear,, 
as few synovial biopsies have been performed at the very early stages of RA. However, since most 
of the previously mentioned cytokines are found in synovial fluids and chronic RA synovium, these 
findings have relevance to RA. 

[0007] The recent advent of high-throughput methods, such as serial analysis of gene 
expression (SAGE) and DNA microarrays, have allowed large-scale, genome-wide 
characterizations of gene expression to be performed. Whole-genome expression profiling 
represents a major advance in genome-wide functional analysis. In a single assay, the 
transcriptional response of each gene to a change in cellular state,' including a disease or a chemical 
perturbation, can be measured. These changes in gene expression can reflect changes in mRNA 
levels or changes in the cells (proliferation or infiltration) that synthesize these mRNAs. DNA 
microaxray technology is well-suited for analyzing chronic diseases, such as autoimmune arthritis, 
because of flie wide spectrum of genes and endogenous mediators involved. A recent report 
describing the analysis of RA and inflammatory bowel disease tissues used a microarray of about 
100 genes known to have a role in inflammation. IL-6 and several matrix metalloproteinases were 
markedly upregulated in RA tissues; however the observed upregulation of matrix metallo-elastase 
(HME) was unexpected, since its expression was previously thougjit to be limited to alveolar 
macrophages and placental cells. Analyses such as these are able to identify genes, both known and 
novel, and discover their coordinately regulated expression during the disease process. 



-2- 



wo 03/072827 PCT/US02/35433 

[0008] Analysis of global gene expression in disease joints is likely to lead to a fuller 
understanding of the inflammatory processes responsible for arthritis. In the present study, DNA 
microarray technology was used to identify novel genes and biological pathways involved in CIA 
and to test the hypothesis that the previously observed set of stage-specific differentially activated 
genes in CIA represents a larger transcriptional profile. 

r>^^m ma^ Qf the I nvention 
[0009] Using microarray analysis, the expression of 8734 cDNAs was analyzed during 
various stages of mouse collagen induced arthritis (CIA), an animal model of RA. From the results, 
a method for the diagnosis and treatment of RA was developed. 

[0010] Embodiments relate to methods for the diagnosis and analysis of autoimmune 
disease or arthritide , in a patient. The methods can include, for example, obtaining a patient 
sample containing mRNA; analyzing gene expression using the mRNA that results in a gene 
expression signature of that mRNA, wherein the gene expression signature includes the 
identification and quantitation of gene expression from genes that have been identified as being 
differentially expressed in RA; and using that gene expression signature to diagnose or analyze the 
autoimmune disease or arthritide in said patient, wherein said gene expression of at least about 60% 
of said genes correlates with that of said gene signature. 

[0011] The autoimmune disease or arthritides can be, for example, Rheumatoid 
Arthritis, Lupus, Ankylosing Spondylitis, fibrositis, fibromyalgia, osteoarthritis. Gout, Juvenile 
Rheumatoid Arthritis, an autoimmune disease caused by an infectious agent, and the like. 
Preferably, the autoimmune disease or arthritide can be rheumatoid arthritis. The patient can be, for 
example, a human, a primate, a dog, a cat, a horse, a sheep, and the like. 

[00121 The analysis can be, for example, an analysis of severity of the disease, an 
analysis of pain manifestation, an analysis of deformity, an analysis of treatment mefliods, an 
analysis of treatment efficacy, and the like. 

[0013] The gene expression analysis can involve at least about 10 genes that are 
identified as differentially expressed in arthritis, preferably at least about 50 genes that are 
identified as differentially expressed in arthritis, more preferably at least about 100 genes that are 
identified as differentially expressed in arthritis, and the like. 

[0014] The genes identified can be expressed at least about 1.5 fold higher or lower 
than normal, at least about 2 fold higher or lower than normal, at least about 3 fold higher or lower 
than normal, and the like. 

[0015] The genes can include, for example, the 385 genes or ESTs in Table 1 (SEQ ID 
NOS:l-385), homologs, variant thereof, and the like. The genes can include the genes in cluster A, 
and in embodiments the genes in cluster A can be down-regulated (SEQ ID NOS:l-37) at least 
about 2 fold, for example. Further, the genes can include the genes in cluster B, and in 
embodiments the genes in cluster B can be up-regulated (SEQ ID NOS:l-37) at least about 2 fold 
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only in late or severe disease, for example. The genes can include the genes in cluster C, and in 
embodiments the genes in cluster C can be up-regulated (SEQ ID NOS:l-37) at least about 2 fold 
only in early or mild disease, for example. Also, the genes can include the genes in cluster D, and 
in embodiments the genes in cluster D can be up-regulated (SEQ ID NOS: 1-37) at least about 2 fold 
in early or mild disease and more in late or severe disease, for example. Furthermore, genes can 
include the genes in cluster E, and in embodiments the genes in cluster E can be up-regulated (SEQ 
ID NOS:l-37) at least about 2 fold in both early or mile and late or severe disease, for example. 

[00161 Also, the differentially expressed genes can include the 385 genes identified as 
SEQ ID NOS:l-385, for example. If the genes in clusters B or D are upregulated, the disease can 
be diagnosed as severe. Furthermore, if the genes in cluster A are upregulated, the disease can be 
diagnosed as moderate to low-grade. 

(00171 Further, the gene expression of at least about 70% of the genes correlates witia 
that of the gene signature, preferably, the gene expression of at least about 80% of the genes 
correlates with that of the gene signature, more preferably, the gene expression of at least about 
90% of the genes conrelates with that of the gene signature, still more preferably, the gene 
expression of at least about 95% of the genes correlates with that of the gene signature, and the like. 

[00181 Aspects and embodiments of the invention further provide methods for the 
treatment of RA that include down-regulating at least one of the genes identified in clustors B 
through D. Such down-regulation can be achieved by adding antisense oligonucleotides specific for 
the gene that is being down-regulated, or by adding or expressing a repressor of the gene that is 
being down-regulated. 

[0019] In oflier embodiments, the invention provides methods for the treatment of RA 
which involve up-regulating at least one of the genes in cluster A, for example, by adding or 
expressing a transcriptional activator of the gene fliat is being iq)-regulated, or by adding a vector 
that expresses the protein encoded by the gene that is being up-regulated. 

[0020] Further aspects and embodimetns of the invention provide methods for the 
identification of genes for targeting in the treatment of rheumatoid arthritis in a mammal other than 
a mouse, which methods involve identifying homologs of SEQ ID NOS: 1-385. 

[0021] Still other aspects and embodimetns of the invention include methods for the 
diagnosis of rheimiatoid arthritis in a mammal, the methods including obtaining a tissue or fluid 
sample from a diseased patient; isolating mRNA fi-om said sample; using the isolated mRNA to 
analyze the gene expression of at least about 40 genes, selected from the group consisting of SEQ 
ID NOS: 1-385 or a homolog thereof, obtaining a fingerprint of the patient's gene expression; and 
identifying whether at least about 60% of said fingerprint is at least about 2 fold differentially 
expressed from that of a normal patient. 

[0022] Other embodiments include an array or a genechip, specific for rheumatoid 
arthritis, including at least 10 of the genes selected from the group consisting of SEQ ID N0S:1- 
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385 or homologs thereof. The array or genechip can include at least 40, 50, 75, 100, or more, of the 
genes selected from the group consisting of SEQ ID NOS:l-385 or homologs thereof. In some 
embodiments, the array or genechip consists essentially of such genes, including up to all of the 
genes of SEQ ID NOS: 1-385 or homologs thereof Such genes can allow for the identification of 
the severity of the disease, the prognosis of the disease, the diagnosis of the disease, the most 
efficacious treatment of the disease in a specific patient^ and the like. 

[0023] In other embodiments, the invention provides methods for the diagnosis or 
analyses of autoimmune disease or rheumatoid arthritis, including: obtaining mKNA from a 
patient; using the mRNA as a probe for the analysis of the arrays or genechips disclosed herein; and 
comparing the results obtained with those of a normal patient. 

[0024] Additional embodiments and aspects provide methods of screening the efficacy 
of a candidate drug in vitro for the treatment of collagen-induced arthritis including: identifying 
vascular endothelial cells expressing FARP mRNA and protein; introducing a candidate drug to 
said endothelial cells; and evaluating whether said candidate drug causes raihanced or normalized 
apoptosis of vascular endothelial cells. 

[0025] Further, the invention in some embodiments provides methods and materials 
for reducing the symptoms associated with collagen-induced arthritis including: identifying a 
subject suffering from collagen-induced arthritis; and administering a compound effective to 
deplete at least one of the group of FARP mRNA, FARP protein, FARP receptor binding, and 
FARP activity. Such compound can include, for example, an anti-FARP antibody, capable of 
interfering with binding of FARP to a FARP receptor. 

Brief Description of the Drawings 
[0026] Figure 1. Hierarchical cluster analysis of 385 genes differentially expressed 
during CIA. The left panel shows the distribution of gene expression across the hierarchical tree 
structure in which the values for the first nonnal sample (1) are set to 1. Rows represent individual 
genes; columns represent individual values of duplicate samples for each experimental time point 
Each cell in the matrix represents the expression level of a single transcript with red and green 
indicating transcript levels above and below the normal values for that gene across all samples, 
respectively. The color code for the signal strength in the classification scheme is shown in the box 
at the bottom left of the panel. Color intensity from pale to deep indicates trust values for the 
expression of each specific transcript. The colored side bar indicates the five basic clusters of gene 
expression, with letters corresponding to their grouping. The mean values of all the genes within 
the indicated groups (A-E) are graphed on the right. 

[0027] Figure 2. Comparison of microarray and RT-PCR analyses of representative 
genes in CIA. The patterns of IL-2Ry and follistatin-like gene mRNA levels, determined by DNA 
microarray analysis from pooled RNA, are compared to patterns determined by real time RT-PCR 
analysis of two individual RNA samples. 
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[0028] Figure 3. IL2-RY is expressed in the synovial tissue during collagen-induced 
arthritis. Panel A (dark-field illumination) and panel B (bright-field illumination) show a section 
through the joint from a normal mouse paw. There is no signal in the joint tissue or surrounding 
periosteal tissue. Panels C and D show a section through a CIA mouse paw 28 days following 
primary CII immunization. There is positive signal (bright white grains) in the synovial tissue 
(arrow) indicating the presence of RNA transcripts for IL2-R7. Panels E and F represent a section 
through the paw of a CIA mouse 49 days following primary CII immunization. There is an 
extensive chronic inflammatory reaction in the tissue (*) surrounding the cortical bone tissue. 
Despite the chronic inflammatory reaction in the tissue no significant IL2-Ry is present in the lesion 
late in disease. (PO periosteal; CB cortical bone; SY synovium; AS articular surface; Mag lOOX) 

[00291 Figure 4. Tissue-specific expression of differentially regulated genes in 
lymphoid organs and cells. The presence of specific gene sequences in cDNA libraries generated 
fi-om the indicated tissues was obtained from the NCBI database using the LocusLink and Unigene 
databases. 

[0030] Figures. Classification ofselected annotated genes. Bars indicate the number 
of the characterized genes that are involved in the specified biological function (A) or pathway (B). 
The number of genes in each of the five expression patterns is indicated on each bar. Some genes 
are represented in more than one category. 

Brief Description of Tables 1 and 2 

[0031] As mentioned above, filed herewith on two con:ipact discs are two copies of 
Table 1, including Tables 1.1-1.3, and Table 2, including Tables 2.1-2.3. The compact discs are 
labeled as "Copy 1" and "Copy 2.'* Each disc has identical content The contents of the discs are 
hereby incorporated by reference in their entireties. 

10032] Table 1 Listing of mouse gene accession numbers, mouse gene name, human 
mRNA homolog, human protein homologs, and Genbank source of human homolog information. 
These genes are divided into clusters A through E by expression characteristics as explained herein. 
Human homologs were identified using unigene and homologene functions at the NCBI database. 
Further information on the homologous human mRNA sequences can be found in Table 1.1 under 
the accession number of interest. Similarly, further information on the homologous human protein 
sequences can be found in Table 1.2, and further information on the "Genbank source" can be 
found in Table 13, 

[0033] Table 2 Listing of relevant ESTs. The ESTs are grouped into clusters A 
through E. as explained herein. Listed are the name of the gene (if known), the accession number 
of the corresponding homologous human mRNA (if known), the Genbank source number of the 
human mRNA information, the Genbank accession number for the mouse gene, and a description of 
similar genes, if known. Further information on the homologous human mRNA sequences 
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corresponding to the ESTs can be found in Table 2.1, under the accession number of interest. 
Similarly, further information relating to the Genbank source number (human) can be found in 
Table 2.2, and information corresponding to the Genbank accession numbers (mouse) can be found 
in Table 2.3. 

Detailed Description of the Preferre d Embodiment 
[00341 Using microarray analysis, the expression of 8734 cDNAs was analyzed during 
various stages of mouse CIA, an animal model of RA. From the results, a method for the diagnosis 
and treatment of RA was developed. Of the 8,734 genes analyzed, 330 were induced and 55 were 
down-regulated greater than two-fold in early or late diseased paws, as compared to normal paws. 
Hierarchical clustering resulted in five distinct expression patterns that correlated with 
histopathologic changes in the paw. Of the 385 genes, the identities of 240 are known. These 
genes are biologically classifiable into 19 functional categories, the largest being immunity and 
defense, and into 20 pathway categories, including membrane, secreted and extracellular. Of the 
known genes, the majority have not been described as playing a role in arthritis. Many of ttiese 
genes are involved in cell proliferation, differentiation, tumorigenesis, apoptosis, and inflammation. 
Thus, these global geme expression patterns in diseased paws reveal a large number of genes novel 
to arthritis, and distinct gene expression profiles distinguishing early and late CIA whose fiirflier 
characterization will advance the understanding of the basic mechanisms responsible for arthritis. 

[0035) The results of the analysis of tiie mouse model of RA include a set of 
differentially expressed genes that can be used for a variety of purposes. The set of differentially 
expressed genes can be thought of as a "signature" or a "fmgerprinf ' of RA. Thus, some 
embodiments of the present invention include DNA arrays or genechips that include one or more of 
the differentially expressed mouse or human genes identified herem. Further embodimrats can 
include a specific subset of the differentially expressed genes that can represent, for example, genes 
that are only up-regulated in late disease or genes that are only up-regulated in eariy disease. A 
"human Rheumatoid Arthritis genechip" c?n be used to fiirther study the gene expression of RA as 
well as other auto-immune diseases, in animal models or in human patients. 

[00361 The results of the analysis of the mouse model of RA are also useful in 
identifying and developing various embodiments of a "human Rheumatoid Arthritis genechip" 
which includes human homologs of the mouse genes identified herein as well as independently 
identified genes. The chip and the information obtained can be used to develop methods for 
diagnosis, prognosis, and analysis of the efficacy of treatments, 

[0037] The analysis of mouse genes herein is believed to have covered approximately 
one third of the genes typically expressed in the mouse genome (a comparable number to that 
expressed in the human genome). Thus, one embodiment is a method for the identification of other 
mouse genes involved in RA. In order to thoroughly identify the genes that are differentially 
expressed in the mouse, arrays or genechips that include a thorough representation of mouse 
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mRNAs are analyzed using the same method of analysis that identified the RA-specific genes 
identified herein. However, using the genes identified in the initial analysis of 8734 genes, human 
or other mammalian homologs can be identified and the differential expression confirmed. The 
method is also useful for further identifying genes that are up- and down-regulated in human or 
other mammahan RA and related conditions. Numerous human homologs of the mouse genes are 
also differentially regulated in human RA comparably to the differential regulation in mouse CIA. 

[0038] Thus a method is described herein that identifies the pattern of specific 
differentially expressed genes, also referred to as the "signature" or "fingerprint" for a particular 
disease state or a particular patient. The signature is used to diagnose RA in a patient and to 
analyze the severity of the disease. The pattern of specifically up and down-regulated genes is 
compared to a "normal" patient, a patient who does not have RA. 

[00391 Briefly, genes that are differentially regulated firom the normal in patients with 
RA are identified by any method known to one of skill in the art. With identification of genes 
involved in the disease and progression of RA, the genetic data are useful in developing a number 
of methods for use on a patient who has or may have RA or other arthritides. 

[0040] Preferred methods involve the identification of the signature of differential 
expression of one or more of the identified genes for a specific patient. In some embodiments, the 
method includes isolation of mRNA firom a diseased tissue, blood sample, or synovial fluid sample 
fi-om a patient. The expression of the genes that are specifically identified as differentially 
regulated is analyzed. The "signature" is produced as the pattern of up and down-regulated genes 
wittiin that patient's sample. The signature can be used for diagnostic methods, for prognostic 
methods, for analysis of the most efficacious treatment for the patient, and for analysis of the 
efficacy of the treatment or the progression of the disease. 
Mentifvinff human genes that are differentially regulated in RA 

[00411 In some embodiments, the genes that are differentially regulated in human RA 
are identified by a) using mouse genes associated wifli CIA to identify human and/or other 
mammalian homologs thereof using database comparisons, b) using mouse genes associated with 
CIA to isolate homologs fi-om gene libraries of an animal of interest and/or c) using genes that are 
known to be involved in mammalian RA and mammalian homologs of those genes. 

[0042] In a further embodiment, the genes that are differentially regulated in 
mammalian RA are identified by microarray analysis using mRNAs firom a mammal with RA, 
using a method comparable to that used herein for identification of the mouse genes. Preferably, 
the methods identify a thorough representation of the genes involved in RA by one method or 
another. 

[0043] In some embodiments, the mRNAs fi-om the mammal with RA are obtained 
fi-om a tissue, biological fluid or mixture thereof that contains mRNA. In further embodiments, the 
mRNAs are isolated firom diseased synovial tissue or synovial fluid. In still fiirther embodiments. 
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the mRNAs are isolated from a blood sample, a saliva sample, or a urine sample. In preferred 
embodiments, a patient sample is used for which the expression of genes is altered due to the 



[0044] Homologs can be genes or DNAs that are 40% similar or more to the mouse 
genes identified, alternatively, the homologs are at least 50% similar, including 55% similar, 60% 
similar, 65% similar, 70% similar, 75% similar, 80% similar, 85% similar, 90% similar, 95% 
similar, and 99% similar. Homologs that are more similar are generally most closely related to the 
mouse sequence, and thus are in many cases most likely to exhibit similar differential expression in 
RA. However, the amount of similarity can vary depending on the importance of the i-egion of the 
gene identified. For example, if the mouse gene is a kinase, the kinase regions are likely to be more 
homologous or similar then the other regions. The homologs can be DNAs that hybridize under 
stringent conditions to the mouse genes identified. The stringent conditions under which a 
homologous gene or DNA will hybridize with the mouse gene can be defined as follows: O.IX 
SSPE, 0.1% SDS wash solution at 65«C with 2 washes. (IX SSPE is 180 mM NaQ, 10 naM 
NaH^P04, 1 mM EDTA (pH 7.4)). The identification of mammalian homologs can be 
accomplished using any method known to one of skill in the art. Any genes that have been 
identified or will be identified as being involved in the disease can be included. Certain genes 
having a more central or "important" role in different aspects of the disease are thus identifiable. 
Thus, the subset of genes that are analyzed or contained in a microarray or genechip can be chosen 
based on flie direct or indirect role the gene is found to play in the disease. Alternatively, subsets 
can be chosen based on what aspect of the disease is being tested. Thus, in some embodiments, 
those genes that are identified as being involved m "activating" the disease will be included 
particularly when diagnosis is the desired result In a further embodiment, those genes that are 
identified as involved in ••progression" of the disease will be included, particularly when treatment, 
prognosis, or staging of disease is being analyzed. In a fiirther embodiment, those genes involved 
in remission, regression, or healing of the disease are included, particularly when prognosis, 
efficacy of treatment, and/or staging of the disease are being analyzed. 

[00451 The above metiiod can be altered and applied to all mammals. Thus, in some 
embodiments, the patient is a mammal. In a fiirther embodiment, the mammal is a human, primate, 
dog, cat, or horse. Because the incidence of RA in humans is particularly significant, some 
embodiments include methods for the diagnosis, prognosis and analysis of human RA. Hxraian 
homologs are identified by methods known to those of skill in the art. hi one embodiment, human 
homologs are identified using computer programs that search for "closest homologs" by inputting 
the mouse genes and ESTs identified herein. In a further embodiment, the computer analysis can 
use "active" portions of the sequences or those parts of the gene sequences that are known to be 
more highly conserved between mammals. The portions that are more highly conserved can be 
involved in the activity of the protein expressed therefi-om. A variety of computer programs can be 
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used to identify the closest mammalian homologs. In many cases, there can be more than one 
human homolog that corresponds to the mouse gene. 

[00461 In a further embodiment, human homologs are identified by performing the 
microarray analysis that was used to identify the mouse genes herein. In preferred embodiments, a 
thorough representation of tlie human genes that are expressed is analyzed. For example, it is 
beheved that approximately 100,000 genes are actively expressed or included in the human 
genome. Thus, in order to thoroughly identify those that are involved in the disease RA, a complete 
representation of the approximately 100,000 genes are analyzed. For example, one or more arrays 
that contain a thorough representation of the human genome are used to analyze gene expression. 
In one embodiment, the arrays are from one or more tissues or fluids. In a further embodiment, the 
arrays are analyzed in duplicate, in triplicate, or in multiple copies. In one embodiment, differential 
expression can be identified as at least about a 1.4 to 2 fold difference in expression from normal. 
In a further embodiment, the differential expression is identified as about a 1.6 to 2 fold difiference 
in expression. In a further embodiment, the genes are identified as dififerentially expressed in RA 
when there is at least about a 2 fold difference in expression from normal. In a further embodiment, 
the genes are identified as differentially expressed in RA when there is at least about a 2.3 fold 
difference in expression from normal. In a further embodiment, the genes are identified as 
differentially expressed in RA when there is at least about a 2.5 fold difiference in expression from 
normal, including at least about 2.6 fold, 2.7 fold, 2.8 fold, 2.9 fold, 3 fold, 3.5 fold. 4 fold, and 5 
fold. However, some genes can show a higher difference in expression than oflicrs. These genes 
can be more involved or alternatively, equally involved in the manifestation of disease as a gene 
that is less differentially expressed. 

[0047] From the above analysis, a "signature" or "fingerprint" can be produced that 
includes the genes that are differentially expressed in the disease and tfie range of expression that 
can be seen among different patients. In one embodiment, the differential expression can be due to 
different aspects and manifestations of the disease. For example, the fingerprint can be a 
fingerprint of early RA, late RA, mild RA, extreme RA, RA in remission, a manifestation of RA 
with litfle pain, but considerable deformity, a manifestation of RA wifli considerable pain, but little 
deformity, etc. 

[0048] The expression of many of the genes identified is confirmed using alternative 
methods ioiown to one of skill in the art, including Northern blotting, quantitative PGR techniques 
such as real-time PGR, or oAer methods of expression analysis. Alternatively, the translation 
products and expression can be analyzed by methods known to one of skill in the art, such as 
Western blotting, activity assays, etc. 

[0049] In a further embodiment, the genes identified as part of the "signature" or 
"fingerprint" are further analyzed as to their involvement in the disease. In one embodiment, a gene 
is further analyzed by any method known to one of skill in the art and can identify the involvement 
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in activation, progression, pain manifestation, deformation, and treatment of the disease. Patients 
that express certain genes or subsets identified above will often show a greater response to certain 
types of treatments then others. For example, if one patient expresses high amounts of IL-2, that 
patient would respond better to treatments that target IL-2 activity, expression, or the downstream 
effects of IL-2. 

[00501 One embodiment of this "signature" or "fingerprint" is an array or a genechip 
that includes the genes that are identified as differentially expressed in one or all manifestations of 
RA, which can be referred to as a "human Rheumatoid Arthritis genechip." A variety of genechips 
can be produced that are specific to different aspects of the disease. In one embodiment, a genechip 
can be produced with only those genes that are identified as possessing key roles in each aspect of 
the disease. In a further embodiment, a genechip can be produced that includes only those genes 
that are expressed late in disease or in severe disease. 

Method of Hia pnnsis. prpgnnsis, and t r patmCTt analysis of a patient With rheim^toid arthritis 

[0051] The genes that are identified above as being involved in RA can be analyzed as 
to differential expression in a specific patient by any means loiown to one of skill in the art. Some 
embodiments involve isolation of the mRNA firom a patient sample. 

[0052] Briefly, mRNA is isolated from at least one tissue or sample firom the patient 
In one embodiment, the sample is a diseased tissue sample, including but not limited to synovial 
tissue. In a fiirther embodiment, tlie sample is a fluid containing disease cells or mRNA, including, 
but not limited to, synovial fluid, and blood. 

[0053] The mRNA can then be used to analyze gene expression by any method known 
to one of skill in the art. In one embodiment, the mRNA is used to analyze a '"human Rheumatoid 
Arthritis genechip" or array. From this analysis, a specific patient "signature" of the genes and 
amount of differential expression is produced. The amount of differential expression is compared 
to a normal patient In one embodiment, the ranges and values of expression for a normal patient 
are derived using at least 2 normal patients, including at least 3, at least 4, at least 5, at least 10, at 
least 20, and at least 50. In a fiirther embodiment, the ranges and values of expression for a normal 
patient arc derived using a statistical sampling of the population, or a statistical sampling of the 
area, ethnic group, age group, social group, or sex. In a fiirther embodiment, the range and values 
of gene expression for a normal patient are derived fi-om the patient before disease or during 
remission. 

[0054] The results of the signature can be used in any one or more of the methods 
disclosed herein. Alternatively, one or more of the analyses can be included in one chip or array. 
The specific signature can include the results of the expression levels of one or more genes in that 
specific patient. In one embodiment, the signature is the results of the expression levels of at least 
10 genes, preferably 40 genes, however, the signature can include the results of 50, 60, 70, 80, 90, 
100, 150. 200, 250, 500, 750, 1000, 2000, 5000, and 10, 000 genes which have been identified as 
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being differentially expressed in RA. Some genes are more important or more involved in the 
manifestation or activation of the disease. Thus, the signature can require fewer genes when those 
that are more impoitant have been identified and included. 

[0055] In one embodiment, the results of the signature are used in a method of 
diagnosis. The method of diagnosis can include, a method of diagnosis of rheumatoid arthritis, a 
method of diagnosis of severity of the disease, a method of diagnosis of a manifestation of the 
disease and can include any or all of the above. Many of the same genes that are differentially 
expressed or involved in the manifestation of RA can also be involved in a different autoimmime 
disease. Alternatively, many of the same genes that are differentially expressed or involved in the 
manifestation of RA can also be involved in a different arthritide. Thus, the method of diagnosis 
can diagnose an arthritic or autoimmune disease, including, but not limited to. Lupus, Juvemle RA, 
Ankylosing Spondylitis, gout, osteoarthritis, fibrositis and fibromyalgia, Scleroderma, and even the 
autoimmune manifestations of Lyme disease and Streptococcus infection. 

[0056] In a further embodiment, the results of the signature can be used in a method 
for prognosis of disease. The prognosis in various patients can vary tremendously. Some patients 
may progress very rapidly and may need a very aggressive treatment plan. Other patients may have 
a very mild version and may progress very slowly, requiring a more subtle treatment plan. This can 
be important when considering side effects, quality of life, and patient needs. 

[0057] In a further embodiment, the results of ttie signature are used in a method of 
identification of the most efficacious treatment for that specific disease and for that specific patient 
The treatment and the response to a drug can depend on which genes are being expressed. For 
example, in its most simple form, a patient with little IL-2 expression would not be best treated 
using a treatment that targets IL-2, However, the choice of a treatment method can involve a 
number of fectors besides the gene expression of specific genes, including, the form of the disease, 
the severity of the disease, the manifestation of the disease, and the needs and wants of the patient. 
Many of these factors can be identified using one of the methods included herein. 

[0058] In a further embodiment, flie results are used to identify single nucleotide 
polymorphisms (SNPs), mutations, or Restriction Fragment Length Polymorphisms (RFLPs) 
associated with RA or other autoimmune diseases or other arthritides. The genes that are identified 
can be included in one or all of the genechips, arrays or analyses herein. In an alternative 
embodiment, a genechip that includes single nucleotide polymorphisms (SNPs), mutations, or 
Restriction Fragment Length Polymorphisms (RFLPs) is produced and used for diagnosis, 
prognosis, and/or identification of the best treatment or drug for use in treating RA. 
Method of Identifying Targets for Drugs 

[0059] In a further embodiment, the results of the signature are used to identify drug 
targets. Any or all of the genes identified herein and included in the signature or on a rheumatoid 
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arthritis array can be used to further identify drugs or treatments that would target that gene or gene 
product. 

[0060] Methods of identifying targets can include any method Icnown to one of skill in 
the art, including, but not limited to: producing and testing small molecules, oligonucleotides 
(including antisense, RNAi and triplex formers), antibodies, and drugs that target any of the genes 
or gene products identified herein. Alternatively, gene therapy can be used to down-regulate, up- 
regulate, or express proteins or gene products identified herein. 

[00611 The present methods will be further described by use of the following 

examples, 

[0062] In some of the following examples, the paws of niice with collagen-induced 
arthritis were analyzed in early disease and late disease by isolation of the RNA and microarray 
analysis. The results were confirmed using RT-PCR and in situ hybridization. Down- and up- 
regulation of genes was identified and the genes were clustered into groups. Human homologs are 
identified and the expression patterns are used to diagnose RA, to analyze the severity of disease in 
a patient, and to identify new treatments for arthritis. A number of genes were identified that 
previously had not been identified as being involved in arfliritis; the genes thus identified can 
represent gene targets for drug therapy. 

[0063) In the Examples relating to mouse experiments, DBA/1 mice were immunized 
with type H bovine collagen to induce arthritis, and mRNA was isolated firom paws of non- 
immunized mice and from severely affected paws of mice at 28 days (acute disease model) and 49 
days (chronic disease model) following the primary collagen injection. A single common reference 
control was used for all microarrays consisting of mRNA derived from the whole of a postnatal day 
1 mouse, and all mRNAs were hybridized to duplicate microarrays (Incyte Phannaceuticals, Inc., 
Palo Alto, CA). Among the 385 disease-specific genes differentially regulated in CIA are 102 
expressed sequence tags (ESTs). Microarray analyses will help in further mapping out differences 
in gene expression between nonnal synovium and the synovium of acute and chronic CIA, 
including the identification of novel genes involved in arthritis. 

Example 1 

P ypductjon of mice with collagen-induced arthritis (CIA^ 
[0064] Mice with collagen-induced arthritis were used as a model for RA. Male 
DBA/IJ mice, 6 to 8 weeks of age, were purchased from The Jackson Laboratory (Bar Harbor, ME). 
Mice were housed in the animal care facility at The Children's Hospital Research Foundation 
(Cincinnati, OH) under Institutional Animal Care and Use Committee approved conditions. 
Arthritis was induced with bovine type H collagen (CII, Elastin Products Co., Owensville, MO), as 
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previously described (Thomton, et al. J. Immunol (2000) 165:1557-1563). Briefly, mice were 
injected intrademally with 100 ng of CH in complete Freund's adjuvant (CFA) at the base of the 
tail on day 0, and a similar booster was administered on day 21. Mice were evaluated for arthritis 
using an established macroscopic scoring system ranging from 0 to 4 (0 = no detectable arthritis, 1 
= swelling and/or redness of paw or one digit, 2 = two joints involved, 3 = three or four joints 
involved and 4 = severe arthritis of the entire paw and digits). At day 28 (early disease) and day 49 
(late disease) following primary immunization, mice were sacrificed. Hind paws with an arthritic 
score of four were removed for mRNA analysis and in situ hybridizations (ISH). Paws from mice 
of the same age not treated with CII were used as normal controls. 

Example 2 

mRNA expre ssinn profiling of eariv and late CIA 
[0065] Differential gene expression in paws of mice with CIA was analy2Bd in early 
(day 28) and late (day 49) arthritis and compared to that of paws from normal mice. These time 
points were chosen based on earlier studies that demonstrated their correlation with distinct 
histologic appearance and mRNA expression patterns by RPA. 

[00661 RNA was isolated from paws that were quick frozen in liquid nitrogen and 
stored at -SCC. Frozen paws were minced with a scalpel and homogenized wife a Polytron Tissue 
Tearor (Biospec Products, Bartlesville, OK) in appropriate volumes of RNA Stat-60 (Tel-Test, 
Friendswood, TX). Total RNA was extracted from the tissue homogenates according to Ihe 
manufacturer's instructions. Pooled total RNA from normal (4 paws), early arthritic (3 paws) and 
late arthritic (4 paws) paws was used to isohite polyA+ RNA by the Oligotex mRNA isolation kit 
(Qiagen, Valencia, CA) according to the manufacturer's instructions. RNA concentrations were 
measured by fluorometry using flie Ribogreen RNA Quantification Kit (Molecular Probes. Inc.. 
Ei^ene, OR). 

(00671 DNA micioarray analysis was performed as follows; mRNA of a whole 1 day 
old mouse was used for normalization of gene expression levels across all six microarray chips. 
Competitive hybridizations with Cy3 labeled whole 1 day old mouse mRNA versus Cy5 labeled 
normal paw mRNA, Cy5 labeled early paw mRNA or Cy5 labeled late paw mRNA were 
performed. Each sample (nonnal, early and late) was labeled and hybridized to two microarray 
chips. Hybridizations ware performed on the mouse GEMl array by hicyte Genomics (Palo Alto, 
CA). 

[0068] Primary data were examined using Incyte Gemtools software and GeneSpring 
version 4.0.4 software (Silicon Genetics, Redwood City, CA). Defective cDNA spots (irregular 
geometry, scratched, or <40% area compared to average) or spot fluorescence hybridizations with 
signal to noise ratios less than 2.5:1 were eliminated from the data set. Data sets were subjected to 
normalization fast within each microarray experiment such that the median of the Cy5 channel was 
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balanced against the ratio of the Cy3 channel ac*(MedianCy3) = MedianCyS, where k is the ratio of 
the median intensities in each).. Each microarray contained control genes present as non- 
manmialian single gene "spikes" or "complex targets". The complex targets consisted of probe-sets 
that contain a pool of cellular genes expressed in most cell types. In addition, each experimental 
mRNA sample was augmented with incremental amounts of non-mammalian gene RNA (2X, 4X, 
16X, etc) to permit assessment of the dynamic range attained within each microarray. Little 
variation was observed across the microarray series with respect to the 192 control genes (not 
shown), providing support for inter-anray comparisons of temporally regulated genes. Genes were 
clustered according to their expression pattern by subjecting the log-transformed data (R= 
log2Cy5/(lcCy3), where R is the log of the expression ratio for each gene) to the hierarchical tree 
clustering algorithm as implemented in the GeneSpring program (Silicon Genetics). The 
hierarchical tree analysis was performed using a minimum distance value of 0.001, separation ratio 
of 0.5 and the standard correlation distance definition. 

[00691 Mouse sense and antisense RNA probes were synthesized using the Stratagene 
RNA Transcription Kit (Stratagene, La JoUa. CA). T3 or T7 RNA polymerase produced ''S- 
radiolabeled antisense or sense single-stranded RNA probes, respectively. A sense probe generated 
from an unrelated mouse gene was used as a negative control for in situ hybridization. 

[0070] For early and late disease, mRNA from paws with severe arthritis (score of 4) 
were used to generate probes that were hybridized to Incyte Mouse GEMl chips, as was mRNA 
from normal mouse paws. Hybridizations were conducted on duplicate chips, allowing for flie 
elimination of genes whose expression levels differed by greater than 50% between the duplicate 
samples. 8,734 cDNAs, including known genes and ESTs, ware represented on the microarray 
chip. 385 genes exhibited a greater than two-fold difference in expression between arthritic and 
normal paws and were selected for further analysis. Expression of 304 of these genes differed only 
between arthritic and normal paws, and expression of 81 of these genes differed between early and 
late arthritis. However, some of the genes identified were duplicates. Thus, the genes listed in 
Table 1 include some duplicates. 

[OOTIJ Figure 1 demonstrates the 385 selected genes and their average levels of 
expression as compared to nonnal tissue values. The majority of genes were more highly expressed 
in arthritic paws as compared to normal paws. Genes were clustered according to their expression 
pattern during disease by hierarchical tree analysis. The resulting hierarchical tree structure 
revealed five distinct patterns of expression. Approximately half of the genes, represented by 
clusters D and E in Table 1 (225 genes, 58.4%), were upregulated both in early and late disease. It 
was possible to separate these genes into those with similar expression levels in early and late 
disease (cluster E in Table 1) and genes whose expression levels further increased duiing late 
disease (cluster D in Table I). These may represent two distinct patterns or a continuum of 
coordinated regulated gene groups. Cluster C in Table 1 (105 genes, 27.3%) represents genes 



-15- 



wo 03/072827 



PCT/US02/35433 



principally uprcgulated in early disease. Cluster B in Table 1 (18 genes, 4.7%) represents genes 
predominantly upregulated in late disease. Cluster A in Table 1 (37 genes, 9.6%) represents genes 
downregulated during both early and late disease, compared to normal paws. The individual genes 
and the number of ESTs belonging to each cluster are listed in Table 1. Please see Table 2 for the 
EST accession number and Table 3 for a schematic representation of the characteristics of Clusters 
A through E. 



Table 1 Sequp ^nnes Human Homologs and AccessiQH Nupiber? 



Cluster A; 

Mouse # Name (mouse) 



Human 
mRNA# 



Human 
Protein # 



Genbank Source 



W09829 trefoil factor 2 (spasmolytic 

protein 1) 
W36838 uteroglobin 
AA028678 palate, lung, and nasal epitlielium 

expressed transcript 
AA047966 four and a half LIM domains 1 
AAl 08401 solute canrier family 27 (fatty acid 

transporter) 
A A 145 089 potassium voltage-gated channel, 

subfamily H, member 2 
AA241859 betaine-homocysteine 

methyltransferase 
AA271284 myoglobin 

AA261313 nuclear receptor subfamily 1, 

group H, member 4 
AA275042 amine N-sulfotransferase 
AA268 1 20 cytochrome P450, steroid 

inducible 3al 1 
AA5O1052 cardiac morphogenesis 



NM 005423 NP_005414 AH003622 



NM 003357 
NM_016583 

NM_001449 
NMJ03645 

NM_000238 

NMJ01713 

NM 005368 



NP_003348 
NP_057667 

NP 001440 
NPIo03636 

NP_000229 

NP_^001704 

NP 005359 



NM_005123 NP_005114 

NM_.001054.1 NP_001045 
NM 007818 NP 001045 



BC0044ai 
BC012549 

BC010998 
D88308 

U04270 

U50929 

X00371,X00372 

X00373 

U68233 

59% homologous 

X 60452 

62% homologous 
AW755250 



Mouse # Name (mouse) 

WII965 enolase 3, J3 muscle 

W64550 tumor-associated calcium signal 

transducer 2 
AA388939 IG a chain C region 
W34420 ATPase, Ca-H- transporting, 

cardiac muscle, fast twitch 1 
AA015155 SlOO calcium binding protein A3 
AA204246 Mus musculus dystonin (Bpagl- 

n) 

W62819 neuronal protein 3.1 

W6493 7 angiopoietin related protein 2 



Human 
mRNA# 



Human 
Protein # 



NM^001976 
NM_002353 

NM_001810 
NmIo04320 

NM 002960 
NM 001723 



NP_0O1967 
NP„002344 

NP_001801 
NP_004311 

NP_002951 
NP 001714 



NM 004772 NP 004763 



Genbank Source 



X5 1957^X56832 
X77753 

AL109804 
AH005190 

Z18948 

L11690,M69225 

U30521 

AF007150,AI467954. 
AI081 
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Mouse # Name (mouse) 



Human 
mRNA# 



Human 
Protein # 



Genbank Source 



W82 159 Fc receptor, IgG, low affinity III 
AI894016 complement component 1, q 

subcomponent, c 
AI324436 Phospholipase A2 group VII 
AA116505 CD53 antigen 
AA 1 777 1 7 Interleukin 1 receptor, type I 
AI322933 Interleukin 4 receptor, a 
AA220007 CD68 antigen 
AA289476 Chemokine (C-C) receptor 2 
AA289559 Ecotropic viral integration site 2 
AI508758 CD14 antigen 
AA286506 Interleukin 4 receptor, a 
AA467489 IntegrinP 2 (Cdl8) 
AA423373 Glycoprotein 49 A 
AA435060 Leucocyte specific transcript 1 
AA475774 Cathepsin C 
AA497620 small proline-rich protein 2A 
W 11 889 Hemochromatosis 
W 1 3905 Fibrinogen/angiopoietin-related 

protein 
W969 1 4 lysy 1 oxidase 
AA08023 1 mannosidase 2, a Bl 
AA152885 small inducible cytokine 

subfamily B (Cys-X-Cys) 
AA170386 colony stimulating factor 2 

receptor, (3 2, low-affinity 
AA201 097 protein tyrosine phosphatase, 

receptor type, C 
A A 1 97349 baculoviral lAP repeat-containing 

2 

AA239171 elastin 

AA268708 Mus musculus hypoxia induced 

gene 2 (Hig2) 
AA259959 CD3 7 antigen 
AA26052 1 uncoupling protein 2, 

mitochondrkl 
AA274 1 04 interleukin 2 receptor, y chain 
AA387058 apoptotic protease activating 

factor 1 

AA547555 CDC28 protein kinase 1 

AA 1 40523 Rac GTPase-activating protein 1 

AA521764 receptor (calcitonin) activity 

modifying protein 2 
AA05 1 654 metallothionein 1 
AA265259 oncostatin receptor 
AA178121 cathepsin S 



NP_000561 
XP_031238 

NP_005075 
NP_000551 
NP_000868 
NP_000409 
NP_001242 
NP_000638 
NP_055025 
NP_000582 
NP_000409 
NP_000202 
NP_077294 

NP_001805 
NP_005979 
NP_0004Ol 
NP_057193 

NP 002308 
NPJ00519 
NP 006417 



NM_000570 
XM_^031238 

NM 005084 
NMIo00560 
NM 000877 
NM~000418 
NM_001251 
NM_000647 
NM„014210 
NM 000591 
NMIo00418 
NM__000211 
NM_024318 

NM_001814 
NM 005988 
NM_000410 
NM^O 16109 

NM_002317 
NM 000528 
NM„006419 

No human 
genes 

NM_002838 NPJ02829 Y00062,Y00638 

NM_001166 NP_.001157 L49431,U37547 

NP 000492 AH007100 



X16863 
AK057792, 
BC009016 
U20157 

AHOl 1005^^37033 
M27492 
X52425 
S57235 
U03882 
AH002689 
X13334 
X52425 
Ml 5395 
AF041262 
LY117 

AU076460,X87212 
X53064 
U60319 

AF153606,AF202636 

AF039290,AF039291 
AH006687 
47% homologous 
AF044197 



NM_000501 
No human 
genes 

NM 001774 
NM_003355 

NM_000206 
NM 013229 

NM_001826 
NM_013277 
NM_005854 

M 10942 

NM_003999 

NM_004079 

NM 003816 



AA2 1 0306 a disiutegrin and 

metalloproteinase domain 9 

AA23045 1 S 1 00 calcium binding protein A8 NM_002964 

(calgranulin A) 

AA2682 1 9 macrophage expressed gene 1 No human 

genes 

W41459 Eukaryotic translation initiation NM_001412 
factor lA 

AA003549 Homolog of human flp-3 NM_003011 



NP_001765 X14046 

NP^003346 AF096289 

NP_000197 D11086 
NP 037361 

NP_001817 X54941 

NP_037409 AL136794 

NP_005845 AJ001015 

AAA5999587 85% homologous 

NP_003990 U60805 

NP 004070 AL3562923C002642, 

BQ006623,M90696 

NP^003807 U41766 

NP 002955 A12027,Y00278 



NP_001403 L18960 
NP 003002 M93651 
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Mouse # Name (mouse) 



Human 
mRNA# 



Human 
Protein # 



AA063753 ATP-binding cassette, sub-family 

A (ABCl), member 1 
AI594919 Intersectin (SH3 domain protein 

lA) 

AI385509 Nuclear factor of k light 

polypeptide enhancer p49/pl00 
AA087193 LipocalinZ 
AA 1 75094 Myristoylated alanine rich protem 

kinase C substrate 
AI45 1 276 SH3 domain protein 3 
AA209640 Histocompatibility 2, complement 

component factor B 
AA276440 Selenoprotein P. plasma, I 
AA432934 Neuiopilin 
AA499926 Peptidylprolyl isomerase A 
AA538499 Phosphatidylinositol-4-phosphate 

5-kinase, type 11, a 
AA414612 Capping protein al 
W42321 Pentaxin related gene 
AA162537 Type II transmembrane protein 

MDL-1 
AI646186 Schlafen4 
AA462202 BP-3 alloantigen 
AI322278 Pyruvate dehydrogenase kinase 4 
W9824 1 proproteui convertase 
subtilisin/kexin type 5 
AA172456 small inducible cytokine A12 
AA178155 small inducible cytokine A4 
AA2678 1 1 lymphocyte cytosolic protein 2 
AA 144482 chemokine (C-C) receptor 5 
AA266002 B-cell leukemia/lymphoma 3 

Mouse # Name (mouse) 



NM_005502 

NMJ03024 

NMJ02502 

NM_005564 
NM_002356 

NM_012383 

NM^oonio 

NM„005410 
NM 003873 
NMI021130 
NM_.005O28 

NM_006135 
NM_002852 
NM^O 13252 

NM_018042 
NM_004334 
NM_002612 
NM^006200 

NM 006273 
NmI002984 
NM_005565 
NM_000579 
NM 005178 



Human 
mRNA# 



NP_005493 

NP_003015 

NP_002493 

NP_005555 
NP,002347 

NP 036515 
NPJ01701 

NP„005401 
NP 003864 
NP"'066953 
NpI005019 

NP^006126 
NP.002843 
NP_037384 

NP 060512 
NPIo04325 
NP 002603 
NpI006191 

NP_.006264 
NP_002975 
NP_005556 
NP_000570 
NP 005169 



Human 
Protein # 



Genbank Source 
AF165281.AF27594S 



AF064244 

X61498 

X99133 
D10522 

BC007459 
BC004143JL15702 

Z11793 
AF018956 
X52851 
BC018034 

U56637 
X63613 
AJ271684 

41% homologous 
D21878 
U54617 
BC012064 

X71087 

J04130 
U20158 . 
AH005786 
M31732 



Genbank Source 



AA064293 cartilage oligomeric matrix 
protein 

AI552 105 alkaline phosphatase 2, liver 

AA145458 fibronectin 1 

AA 1 772 1 8 IG a chain C region 

W63981 fibromodulin 

AA030995 peptidylprolyl isomerase B 

AA241281 aquaporin 1 

AA260949 growtli arrest and DNA-damage- 

inducible, y 
AA5 1 8 1 65 tissue inhibitor of 

metalloproteinase 2 
W33786 procollagen, type VI, a 1 
AA000107 procollagen, type XI, a 2 
AI323 1 3 1 thrombospondin 3 
AA073904 dickkopf homolog 3 (Xenopus 

laevis) 

A A ! 09900 hemoglobin a, adult chain I 
AA537 1 1 6 immunoglobulin superfamily 

with leucine-rich repeats 
AI327504 eukaryotic translation elongation 

factor 1 a 2 



NM 000095 NP 000086 L32137 



NM 000478 
NM_002026 
NM_001810 
NM_002023 
NM 012117 
NMIo00385 
NM 006705 



NM 001848 
NMIo80679 
NM 007112 
NM""013253 



NM^005545 
NM 001958 



NP_000469 
NP_002017 
NP^OOlSOl 
NP_002014 
NP 036249 
NPIo00376 
NP_006696 

A37128 

NP^001839 
NpI542410 
NP 009043 
NPI037385 

P01922 
NP 005536 



AB011406,AH005272 

M15801,X02761 

AL109804 

U05291,X75546 

S62077 

M77829,U41517 
AF079806^265659 

100% homologous 

M20776;X15879, 
AH006115.U32169 
L38969 
AF177396 

85% homologous 
AB003184 



NP 001949 X70940 
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Cluster E; 

Mouse # Name (mouse) 



Human 
mIlNA# 



Human 
Protein # 



Genbank Source 



W13151 thymus cell antigen 1, t 

W54287 Biglycan 

W89883 procoUagen. type m, a 1 



NM__006288 

NM^oonii 

NM 000090 



NP_006279 
NP 001702 
NP 000081 



AA175226 complement component 1, r 

subcomponent 
AA242 149 FK506 binding protein 7 (23 kDa) 
AA209006 complement component 1, s 

subcomponent 
AA270625 tenascin C 
AA538511 histocompatibility 2, L region 
W 1 0072 insulin-like growth factor 1 
W 14393 Sid394p 
W 11571 hexokinase I 
W14289 cathepsin Z 
W16254 tubulin, p 5 
W17813 Talin 

W82677 bone morphogenetic protein 1 
W83904 peptidylprolyl isomerase C 
W89354 procollagen-lysine, 2-oxoglutarate 

5-dioxygenase 3 
W99856 procollagen, type V, a 1 
AA002439 annexin A5 
AA030294 Mus musculus frizzled- 1 
AA030780 peroxisomal 83, 5-2-enoyl- 

Coenzyme A isomerase 
AA038395 Ras suppressor protein 1 
AA060268 phospholipase D3 
AA067258 Calumenin 
AA 1 1 0872 amyloid P (A4) precursor protein 
AAl 18715 CD97 antigen 
AA122791 histocompatibility 2, Q region 

locus 7 

AA22220 1 butyrate response factor 1 

AA2426 1 1 foUistatin-like 

AA397 1 14 annexin A4 

AA259366 Trp4-associated protein TAP 1 

AA25955 1 eukaryotic translation elongation 

factor 1 a 1 
AA271275 metallocarboxypeptidase CPX-1 
AA260248 growth factor receptor bound 

protein 10 
AA437882 ribosomal protein L9 
AA396298 RNAse4 
AA474964 Lactotransferrin 
AA499296 annexin AS 
AA547428 protein kinase, cAMP dependent, 

catalytic, P 
W18828 dihydropyrimidinase-like 3 
AA0489 1 5 guanine nucleotide binding 

protein, P-2, related sequence 1 
W14837 protease, cysteine, 1 
Wl 8376 golgi vesicular membrane 

trafficking protein pi 8 
W80 1 77 matrix metalloproteinase 2 
AA003452 thrombospondin 4 



NM 001733 NP_001724 



AL161958 

AH002674.BC002416 

AI755052^I26939,X144 

20 

X04701 



NM^017946 


NP 060416 






P09871 


75% homologous 




MP n^^08^ 


AF078035,AJ006776 




S48134 


69% homologous 


NM 000618 


NP 000609 


X57025 


NM'^ooesis 


NP 006806 


BC 025957 


NM 022361 


NP 071756 


BC022323 


NM 001336 


NP 001327 


AF136273 


NM_001069 


NP_001060 


X79535 


iNIVL UU0Z07 


>JP 006280 




NM 001199 


NP 001190 


M22488 


NM 000943 


NP 000934 


BC002678 


NM_001084 


NP„001075 


BC011674 


NM_000093 


NP_.000084 


D90279,L38808,M76729 


NM 001154 


NP 001145 


AH0049 14,103745 


NM 003505 


NP 003496 


AB017363 


NM_006117 


NP_006108 


AF153612 


NM 012425 


NP 036557 


L12535 


NM 012268 


NP 036400 


U60644 


NM 001219 


NP 001210 


AF013759,U67280 


NM_000484 


NP_000475 


AH005295 


NM_001784 


NP 001775 


X84700 




137519 


68% homologous 


NM 005141 


NP 005132 


J00129,M64983 


NM 007085 


NP 009016 


BC000055 


NM 001153 


NP 001144 


D78152,M82809 


NM 015638 


NP 056453 


BC013144 


NM 006452 


NP 006443 


BC010273 



NM^019609 
NM_005311 

NM 000661 
NM"002937 
NmI002343 
NM^001155 
NM_002731 

NM_001387 
NM_003922 

NM_005606 
NM_005868 

NM_004530 
NM 003248 



NP_062555 
NP_005302 

NP__000652 
NP 002928 
NpI002334 
NP 001146 
NpI0O2722 

NP_001378 
NP„003913 

NP_005597 
NP_005859 

NP_004521 
NP 003239 



D86962 

BG829769,U09953 

BC015520 

X53961 

103578^7673 

M34181 

D78014 
U50078 

BC003061 
AF007551 

AH002654 
Z19585 
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AA073604 procollagen, type 1, a 1 
A A 1 24340 transforming growth factor, P 
receptor HI 

AA24 1 784 insulin-like growth factor binding 

protein 5 
AA268082 Lumican 
AA260280 procollagen, type III, a 1 



W13698 FK506 binding protein 9 
W141 13 twist gene homolog, (Drosophila) 
AA052081 Atpase, class I, type 8B, member 
2 

W89518 annexinA2 
AI894006 procollagen, type XI, a 1 

AA002481 integrin p 5 
AA023549 procollagen, type V, a 2 

AA033050 serine protease inhibitor 4 

AA037995 microfibrillar associated protein 5 

AA059524 procollagen, type VI. a 3 

AA066921 integral membrane protein 2 

AA108363 ribosomal protein L3 

AA108928 secreted phosphoprotein 1 

AA220699 transcobalamin 2 

AA272097 fibroblast growth factor receptor 
1 

AA45 1495 protocadherin 1 3 

AA509765 Endomucin 

AA542013 fibroblast growth factor receptor 
1 

AA047991 keratin complex 2, basic, gene 1 

W17771 catlielin-like protein 
AA221044 histocompatibility 2, L region 
AI322868 myristoylated alanine rich protein 

kinase C substrate 
AA024088 SH3 domain protein 3 
Wl 8121 histocompatibility 2, complement 

component factor B 
AA266975 cell division cycle 42 homolog (S. 

cerevisiae) 

AA172527 ATP-binding cassette, sub-family 

G, member 1 
AA175651 caspasell 
AA260476 calpain 6 
W98807 FXYD domain-containing ion 

transport regulator 5 
AA068750 stromal cell derived factor 1 
AA109951 p-2 microglobulin 
AA200339 secretory leukocyte protease 

inhibitor 

AA245698 regulator of G-protein signaling 5 
AA268592 transforming growth factor, P 

induced. 68 kDa 
AA272807 histocompatibility 2, class 11 

antigen A, a 
W10023 catenin P 
W12260 surfeit gene 4 



NM^000088 
No human 

genes 

NM_000599 

NM_002345 
NM_000090 

NM_007270 
NM_000474 
No human 
genes 

NM_004039 
NM_001854 

NM_002213 
NM_000393 

NM„006216 
NM_003480 
NM_004369 
NmI004867 
NM_000967 
NM_000582 
NM_000355 
NM_000604 

NM 003735 
NmI016241 
NM 000604 



NP 000079 Z74615 



NP_000590 

NP 002336 
NPIo00081 

NP 009201 
NP 000465 



NP_004030 
NP_001845 

NP_002204 
NP_000384 

NP_006207 
NP_003471 
NP_004360 
NP_.004858 
NP 000958 
NPIo00573 
NP_000346 
NP 000595 

NP_003726 
NP^057325 
NP 000595 



BC007038 

AI755052,M26939,X144 
20 

BC011872 
U80998.X99268 



D00017 

AU118365,J04177,U121 

39 

BC006541 

BC015705>I58529.Y146 
90 

BC015663 
AH007047 
X52022 
AF038953 

BC008492,BC012146 
AF052124 
AF047576,M60396 
M34641;K66945 



NNL001004 NP_00O995 

NM_004345 

NM_002356 

NM„012383 
NM_001710 

NM_00179l 

NM_004915 

NM_004347 
NM_014289 
NM_014164 

NM_000609 
NM_004048 
NM_003064 

NM_003617 
NM 000358 



M34641,X66945 

BC0053543C0059203 
C007573 
NP_004336 Z38026 
S48 134 69% homologous 

NP_002347 D10522 

NP_036515 BC007459 
NP_001701 BC004143,L15702 

NP__001782 AL121735.BC003682.M 

57298 
NPJ04906 X91249 

NP_004338 U28015 

NP_055104 AL031117 

NP J54883 AA04421 1 AA296696A 

F161462,BG025158 

NPJ00600 U16752 

NP_004039 AB021288 

NPJ03055 M74444,X04470 

NP_003608 AB008109 
NP 000349 M77349 



NM_002122 NP_002113 L34083j:46875,M20431 



NM 001904 
NM 017503 



NP 001895 
NP"059973 



X87838 

BC0144113M789997 



-20- 



wo 03/072827 



PCT/USp2/35433 



W14138 kallikrein 3, plasma 



No human 



W 1 4540 histocompatibility 2, K region 
W34612 transglutaminase 2. C polypeptide 
W64075 proline rich protein expressed in 
brain 

W81878 osteoblast specific factor 2 
W82 1 4 1 lysosomal membrane 

glycoprotein I 
W82946 benzodiazepine receptor, 

peripheral 

AAl 19072 ceroid-lipofuscinosis, neuronal 2 
AA123008 membrane bound C2 domain 

containing protein 
AA137942 immunoglobulin J chain 

precursor 

AAl 72867 purine-nucleoside phosphorylase 
AA178779 interferon concensus sequence 
binding protein 

AA185869 p-1.4N- 

actylgalactosaminyltransferase 

AA209884 guanine nucleotide binding 
protein (G protein), y 10 

AA24 1 1 32 coatomer protein complex, 
subunit y 1 

AA230649 histocompatibility 2, class n. 

locus Dma 
AA260654 TG interacting factor 
AA268148 eukaryotic translation elongation 

factor 1-P homolog 
AA396152 CD44 antigen 
AA271576 receptor {TNFRSF)-iiiteracting 

serine-threonine kinase 1 
AA276030 ATPase-like vacuolar proton 

channel 

AA4 14089 heterogeneous nuclear 

ribonucleoprotein D-like 
AA413831 plOO co-activator 
AA060205 butyrate response factor 1 
AA200393 CTP synthetase homolog 
AA416325 ATXl (antioxidant protein 1) 

homolog 1 (yeast) 
AA 1 98703 apoptotic protease activating 

factor 1 
AA457927 polypeptide N- 

acetylgalactosaminyltransferase ] 



genes 

NM 004613 
NM_014764 


P18462 
NP 004604 
NP_055579 


68% homologous 

M55153 

D31767 


NM_006475 
NM^005561 


NP 006466 
NP_005552 


D13666 
J04182 


NM„000714 

NM 000391 
NM 015292 


NP_000705 

NP 000382 
NP_056107 


AH000829,M36035,U12 
421 

AF039704 
BC004998 




P01591 


77% homologous 


NM 000270 
NM_002163 


NP 000261 
NP_002154 


X00737 
M91196 


NM_001478 


NP_001469 


L76079JM83651 


NM_004125 


NP_004116 


BCX)15391 


NM_016128 


NP^057212 


AF100756 


NM_006120 


NP_006111 


X62744 


NM 003244 
NM_007086 


NP 003235 
NP_009017 


X89750 
AJ006266 


NM 000610 
NMIo03804 


NP 000601 
NP_003795 


AJ251595 
U50062 


NM_001694 


NP_.001685 


BC009290,BI548787 


NM_014740 


NP_055555 


D21853 


NM 014390 
NM 005141 
NM 019857 
NM_004045 


NP 055205 
NP 005132 
NP 062831 
NPIo04036 


J00129,M64983 

AK024070 

U70660 


NM_001160 


NP_001151 


AF013263 


NM 020474 


NP_065207 


U41514.Y10343 
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Table 3-. Charactenstics of n^stftrs A through E 



Cluster 


Early 


Late 


A 






B 




T 


C 


T 




D 


T 


TT 


E 


T 


T 



4« gene 

expression increased more than 2 fold. 

Example 3 

rnnfirmatinr. o f mWnarrftv data hv BT-PCR and Tn sifa Hvbridizatiwi 
[0072] Confirmation of fee mictoarray data was performed by measuring the 
expression level of genes in two individual paws at each time point using real time RT-PCR and in 
situ hybridization. 

[0073] Real time reverse transcription (RT) PCR analysis was performed as follows: 
to remove possible genomic DNA contamination, total paw RNA was treated with amplification 
grade DNAse I (Gibco Life Technologies. RockviUe, MD). RNA was then subjected to reverse 
transcription using SUPERSCRIPT Preamplification System for First Strand cDNA Synthesis 
(Gibco Life Technologies). Serial dilutions of the oDNA template were prepared and PCR was 
carried out using a Lightcycler System (Roche Molecular Biochemicals, Palo Alto. CA). After 
each elongation phase, the fluorescence of SYBR Green I. which binds double-stranded DNA was 
measured. Reactions (20 ^1) were performed in microcapillary tubes using 5 \il of diluted cDNA 
with SYBR Green I (Roche Molecular Biochemicals), master mix, upstream and downstream 
primers and MgOj. Sequences of primer pairs were as follows: FoUistatin-like, upstream: 5'-GGA 
TTG AGA ATC AGC ACT GGG-3' (SEQ ID NO:386); downstream: 5'-TTG AAA GGG AGG 
GCA CAG AAC-3' (SEQ ID NO:387); IL-2Ra, upstream: 5'-CGG AAG COT GAA CAT CAA 
TCC-3' (SEQ ID NO:388); downstream: 5'-GCC ACT AAC CCC AAC TCT TAT GAG-3' (SEQ 
ID NO:389); GAPDH, upstream: 5'-ACC ACA GTC CAT GCC ATC AC-3' (SEQ ID NO:390); 
downstream: 5'-TCC ACC ACC CTG TTG CTG TA-3' (SEQ ID NO:391). Reactions containing 
water or cDNA synthesized without reverse transcriptase, as template, resulted in no PCR products. 
Dilutions of cDNA synthesized from early paw RNA were predicted to have the highest expression 
of the gene product being amplified and, thus, were used as the concentration standards. 
Lightcycler quantification software v3 was used to compare amplification in experimental samples 
during the log-linear phase to the standard curve from the dilution series of acute tissue. AU 
experimental samples were normalized to GAPDH (glyceraldehyde-3-phosphate dehydrogenase) 
expression levels for that tissue. Expression levels of each gene were plotted relative to flie levels 
in normal tissue. 
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[00741 In situ hybridization analysis was performed as previously described (Witte, et 
al. Am J. Pathol 1991;139:717-724). Briefly, ten micron cryostat sections of snap frozen tissue 
were air dried on TESPA coated Superfrost Plus (Histology Control Systems. Glenhead, New 
York) slides and post-fixed in 4% (w/v) parafomaldehyde in PBS then acetylated witli acetic 
anhydride as described. Paws were fixed for 48 hours in 4% (w/v) paraformaldehyde (Electron 
Microscopy Sciences, Ft. Washington, PA) in PBS at 4°C immediately after harvesting. Following 
fixation, the tissue was decalcified in TBD-2 (Shandon. Pittsburgh, PA). Complete decalcification 
of the tissue was determined using 5% ammonium oxalate. Following decalcification the tissue 
was rinsed for ten minutes in running water and placed in 30% sucrose in PBS for 24 hours at 4»C. 
The samples were embedded in M-1 mounting media (Shandon), frozen in liquid nitrogen and 
stored at -80°C. Hybridizations were done overnight at 45°C under a sealed coverslip. Following 
hybridization, the sections were treated with RNAse to remove unbound probe and the slides were 
washed extensively under highly stringent conditions. The slides were developed in Kodak D19 
developer (Rochester, NY). Sections were counterstained with hematoxylin & eosin and 
photographed using both dark- and bright-field illumination. 

[0075] Mouse sense and antisense RNA probes were synthesized using the KNA 
Transcription Kit (Stratagene. La JoUa, OA). T3 or T7 RNA polymerase produced "S- radiolabeled 
antisense or sense single-stranded RNA probes, respectively. A sense probe generated from an 
unrelated mouse gene was used as a negative control for in situ hybridization. 

[00761 Alftough none of the genes previously demonstrated to be uptegulated by RPA 
were present on the microarray chip, two genes on the DNA microarray were related to genes 
whose expression patterns we have previously analyzed by RPA. One of the genes, IL-2Ry, had a 
similar expression pattern to the previously observed expression pattern of IL-2. Ano&er gene, 
foUistatin-likc, which is induced by TGFp, had a similar expression pattern to the jHcviously 
observed expression patterns of TGFp 1, 2 and 3. Comparison of the expression of foUistatin-like 
and IL-2RY by microarray and real time RT-PCR revealed similar patterns of expression (Figure 2). 
In addition, spatial expression of IL-2Ry was analyzed by in situ hybridization CFigure 3). The 
expression pattern matched that observed in the DNA microarray hybridizations. IL-2Ry was 
expressed in the inflammatoiy tissue surrounding the joint and in the periosteal tissue along the 
length of the bone. 



Example 4 

niafgrification of difFerentiallv ^pressed genes 
(00771 Of the 385 genes that were found to be diffwentially expressed during CIA in 
the mouse paw, 102 were expressed sequence tags (ESTs) and preferred members of this group 
represent novel genes critical to the pathology of CIA. Excluding diq)licate gene spotting on the 
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Chip, 240 of the 385 gene sequences are annotated genes. Information on their expression in 
various tissues was obtained using LocusLink and Unigene at the National Center for 
Biotechnology Information (NCBI) website (ncbi.nlm.nih.gov/). ITiese genes have been reported in 
a variety of tissues, including but not limited to bone, brain, colon, liver, lung, kidney, mammary, 
skin, spleen and testis. Not surprisingly, the majority ai-e expressed in the lymphoid organs, 
including spleen and lymph nodes (Figure 4). 

[0078] To further characterize the annotated genes, they were grouped into categories 
using Incyte's Function and Pathways categorization (Figure 5). The largest fimctional categories 
included immunity and defense (47 genes), protein metabolism (36 genes), lipid metabolism (11 
genes) and differentiation and prohferation (11 genes). The largest pathways categories included 
membrane (59 genes), secreted and extracellular (59 genes), organefle (24 genes), intracellular 
signaling (17 genes), receptors (17 genes), proteases (15 genes) and antigen recognition (14 genes). 
In most cases, the genes in each category were distributed proportionally to the size of the clusters 
identified in Figure 1. 

[00791 The 240 previously characterized genes that were differentiany regulated 
during CIA were analyzed through extensive Uterature searches. Of these 240 genes, a number of 
genes that have not previously been characterized in autoimmune arthritis but that could potentially 
be involved, were identified. From the literature searches on these particular genes, a number of 
genes were found to be associated with three basic biological functions. Tliese genes, as well as 
flieir temporal expression, are listed in Table 4. 



Table 4 Genes novel to artliritis 


Early 


Late 


Proliferation, differentiation and tumorigenesis 


enolase3, (J muscle 






tumor-associated calcium signal transducer 2 






SI 00 calcium binding protein A3 




* 


angiopoietin related protein 2 




* 


P-1 ,4 N acetylgalactosaminyltransferase 


* 


* 


polypeptide N-acetylgalactosaminyltransferase 1 


* 




endomucin 


* 




growth factor receptor bound protein 10 


* 


* 


growth arrest and DNA-damage-inducible, y 






dickkopf homolog 3 (Xenopus laevis) 


* 


♦ 


CDC28 protein kinase 1 






a disintegrin and metalloproteinase domain 9 


* 




ecotropic viral integration site 2 






selenoprotein P 


* 




proprotein convertase subtilisin/kexin type 5 


* 




B-cell leukemia/lymphoma 3 


* 
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Ap optosis 

apoptotic protease activating factor 1 

regulator of G-protein signaling 5 

calumenin 

CD97 

calpain 6 

caspase 1 1 

receptor interacting protein 
transglutaminase 2, C polypeptide 
CD44 
CD53 

fibrinogen/angiopoietin-related protein 
baculoviral lAP repeat-containing 2 
uncoupling protein 2, mitochondrial 



Inflammation 

annexin A2 
annexin A4 
annexin A6 

lysosomal membrane glycoprotein 1 
protocadherin 13 
catenin beta 
pentaxin related gene 
small proline-rich protein 2A 
small inducible cytokine subfamily B (Cys-X-Cys) 
colony stimulating factor 2 receptor, p 2, low-afEnity 
CD37 

type II transmembrane protein 
BP-3 alloantigen 

Mus musculus h ypoxia induced gene 2 (Hig2) 

[0080] The present study quantitatively analyzed coordinated gene expression on a 
global scale from paws of mice with CIA to identify novel genes involved in arthritis as well as to 
identify gene expression patterns that differ between early and late synovitis in this model system. 
Genes known to be upregulated in CIA or RA were confinned by the analysis. However, most of 
the differentially-expressed genes identified by the microanray have not been previously described 
in arthritis. 

[0081] The difference in expression profiles observed between early and late disease 
has not previously been fully-appreciated. Even though the microarray analysis was limited to two 
time points over the course of the disease, cluster analysis grouped the 385 genes according to their 
mRNA expression in early versus late disease. In some embodiments, the hierarchical clusters can 
represent coordinately expressed genes, the effects of cell phenotype and/or a combination of the 
two. Confirmation of the validity of the microarray expression analysis includes RT-PCR analysis 
of expression of follistatin-like gene and IL-2Ry, as well as analysis of the spatial expression of IL- 
2Ry by in situ hybridization. Of 385 genes on the microarray found to be differentially expressed in 
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CIA. 240 have been previously annotated. These 240 genes can be divided into several biological 
functions and pathways; however, none of the clusters were over-represented in any of these 
categories. 

10082] Included in the group of annotated genes are many that have previously been 
demonstrated to be upregulated in RA, including TIMP-3. p-2 microglobulin, biglycan, lumican. 
insulin-Uke growth factor binding protein 5 and stromal cell derived factor-1, as well as 
proinflammatory genes such as IL-2Ry, small inducible cytokine A12 and A4 (MCP5 and MIPlp 
respectively), CCR5, macrophage expressed gene 1, cathepsins C and S. CD14 and fibronectin. 
Expression of a majority of these 240 genes also occurs in lymphoid organs, which is expected 
since the synovial inflammation is dominated by immune cells. 

[0083] The 240 annotated genes were analyzed through extensive review of flie 
literature, resulting in a list of 43 genes not previously characterized in autoimmune arthritis. Based 
on their known biological functions these genes might play central roles in the padiophysiology of 
the disease. These genes, as well as their temporal expression, are listed in Table 4. Several 
interesting comparisons can be made between the biological fianction of these genes, their temporal 
expression patterns, and the histopathologic appearance of arthritis. 

Examples 
ftTTprtBised throughout CTA 

[00841 Several genes involved in cell prolifraation. differentiation and tumoigaiesis 
were upregulated throughout the disease (clusters D and E), These included p-1.4 N- 
acetylgalactosaminyltransferase and polypeptide N-acetylgalaotosaminyltransfcrase 1, that aie 
involved in the synthesis of gangUosides. whose overexpression is associated with a marked 
increase in growth rate and invasive activity. 

[0085] Numerous genes involved in apoptosis were identified that were expressed bofli 
in early and late disease. Cellular turnover in normal tissues is tightly regulated through a balance 
of cell proliferation and cell death. The regulation of cell populations within the jotat is very likely 
also controlled by apoptotic processes. Apoptosis of cells within the arthritic joint has been 
proposed to be a source of self-peptides that could generate auto-antigens that may propagate 
inflammation. One of these, CD44, has been postulated to play a role in the elimination of 
neutrophils from sites of inflammation in inflammatory kidney disease and its upregulation on the 
surfece of chondrocytes may contribute to cartilage degeneration in RA patients. Other genes 
include calpain 6 and caspase 1 1, which are members of two famihes of cysteine proteases involved 
in flie regulation of pathological cell death. Additionally, receptor interacting protein (RIP) 
interacts with Fas, causing morphological changes in cells that resemble apoptosis. 



-33- 



wo 03/072827 PCTAJS02/35433 

[0086] Inflammatoiy processes occur both early and late in disease. Therefore, the 
identification of genes involved with inflammation was not unexpected; however, various genes 
were identified that had not previously been associated with inflammation in CIA or RA. Tliese 
genes include annexins A2. A4 and A6. which affect the activation and migration of macrophages. 
The human homologue of lysosomal membrane glycoprotein 1. h-LAMPl. is detectable in patients 
with scleroderma and systemic lupus erythematosus and may contribute to the migration of 
activated leukocytes to the sites of inflammation. Catenin-P. when complexed with E-cadherin. is 
upregulated in gut inflammation of patients with spondyloarthropathy. 

Example 6 
nanftR expressed in late CIA 
[0087] Late CIA is characterized by an increase in fibrosis. Fibroblasts taken fiom RA 
patients with chronic disease are in a constitutive state of activation and exhftit plasticity in ceU 
growth. Of the eight annotated genes that are selectively upregulated in late disease listed in cluster 
B of Table 1 . four are involved in cell proliferation, differentiation and tumorigsnesis and may play 
a role in the chronic activation of fibroblasts at late stages of disease. Specifically, tumor associated 
calcium signal transducer 2 is expressed early in tumorigenesis. and angiopoietin related protein 2 
is associated with endothelial cell development and tumorigenesis. 

Fxample 7 
rtftnes expresseH in «5arlv CIA 
[0088] Several genes involved in cell proliferation, differentiation and tumorigenesis 
are selectively upregulated in early disease and are Usted in cluster C of Table 1. CDC28 kinase 
binds to the catalytic subunit of cyclin dependent kinases and may be associated with dysregulation 
of lymphocyte cell cycle control in HIV infected patients. ADAM9, a disintegrin and 
metalloproteinase domain 9. binds MAD2beta, which is involved in cell cycle control. 

[0089] Three apoptosis genes that are selectively upregulated in early CIA have anti- 
apoptotic properties. These include CD53, fibrinogen/angiopoietin related protein and baculoviral 
lAP repeat containing 2. The latter two are involved in endothelial cell survival. The upregulation 
of genes involved in endothelial cell survival, particularly eariy in disease, may allow for migration 
of inflammatory cells into the diseased joint. 

[00901 Genes selectively upregulated in early arthritis (cluster C) include many 
inflammatory genes previously associated with CIA or RA. In addition, numerous other potentially 
pro-inflammatory genes are in this category. Pentaxin-related gene is involved in inflammatory 
reactions, particulariy those of the vessel wall. Small inducible cytokine B subfemily member 13 
(CXCL13) is a chemokine for B lymphocytes. Type H transmembrane protein is expressed 
exclusively in macrophages and monocytes and is involved in activation of myeloid cells. Hypoxia 
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induced gene 2 (interleukin-20) is modulated by hypoxia and may have a role in inflammation, 
possibly in attempting to re-establish homeostasis. 

Example 8 
fipipq ^ that are dow n-regulated 
[00911 Although most of the differentially-expressed genes were upregulated during 
CIA, all the genes in cluster A of Table 1 were downregulated, compared to normal paws. This 
represents a group of potentially important genes, as their downregulation may contribute to the 
loss of homeostasis in the joint and the failure to limit the inflammatory process. One annotated 
gene in cluster A. cytochrome P450, has previously been shown to be downregulated in 
inflammation and certain alleles of cytochrome P450, which are inactive or poor metabolizers, 
show a modest association with susceptibility to ankylosing spondylitis, but not RA. Most of the 
genes in cluster A are ESTs. and their further characterization wiU be of interest. In addition to the 
25 ESTs in cluster A. the further chaiacterization of the other 132 ESTs identified in this study wttl 
provide information about flie gene regulatory network(s) involved in the autoimmune arthritic 
process. 

[00921 In summary, the present study utilized DNA microarray technology to analyze 
coordinated gene expression in paws of mice with early and late CIA. Ihis analysis has revealed a 
large number of genes previously not known to be involved in arthritis, as well as distinct gene 
expression profiles that differentiate between early and late CIA. Further characterization of these 
genes and pathways will advance the understanding of the basic mechanisms responsible for 
initiation and persistence of synovitis and may aid in the development of novel therapies. 

P.xample9 

T«n1«.tinn ftf fiilMpngt^ g«nes identified bv ESTs 
[00931 The 157 expressed sequence tags (ESTs) are used to identify the fiiU-length 
genes associated with them. The EST sequences are used to search public and proprietary computer 
databases. Those that are not identified in &e databases, are used to screen mouse libraries for fiill- 
length cDNA clones using methods known to one of skill in the art. 

Example 10 

THpnrification of Human Homolog s and production of a human micrparray 
[00941 Human homologs are identified by searching databases to find the closest 
human homolog for each of the 385 mouse genes identified herein. Many of the human homologs 
are known. Those that do not possess a homolog in the databases are identified by screening a 
human cDNA library using a mouse probe. In particular, when active regions or highly conserved 
regions of the mouse protein are known, these are used to screen the libi-ary. For example, kinases 



-35- 



wo 03/072827 PCTAJS02/35433 

are known to contain regions that are highly conserved. Thus, if the mouse gene codes for a kinase, 
these regions are included within the probe. Alternatively, or in addition, a degenerate mouse probe 
is produced, with the degeneracy in regions that are less likely to possess high homology, for 
example, a degenerate probe for a kinase is constructed to have more degeneracy around the kinase 
region. 

Kxample 1 1 

mPNA expression prnfilin p of earl v and late rhenmatoid arthritis in hmnms 
[00951 Differential gene expression in the synovial tissue of humans with rheumatoid 
arthritis was analyzed and compared to that of synovial tissue from normal humans. 

(00961 RNA was isolated from a human synovial biopsy and quick frozen in liquid 
nitrogen for storage at -SCC. Frozen synovial tissue was minced with a scalpel and homogenized 
with a Polytron Tissue Tearor (Biospec Products, BarUesville. OK) in appropriate volumes of RNA 
Stat-60 (Tel-Test, Friendswood, TX). Total RNA was extracted from the tissue homogenates 
according to the manufacturer's instructions. Pooled total RNA from normal synovial biopsy 
samples, mild arthritic synovial biopsy samples and severe arthritic synovial biopsy samples was 
used to isolate polyA+ RNA using &e Oligotex mRNA isolation kit (Qiagen, Valencia. CA) 
according to the manufecturw's instructions. RNA concentrations were measured by fluorometiy 
using the Ribogreen RNA Quantification Kit (Molecular Probes, Inc., Eugene, OR). 

[00971 DNA mirax)array analysis was performed as follows: mRNA from a human 
without RA was used for normalization of ^e expression levels across all microarray chips. 
Competitive hybridizations with Cy3 labeled normal human mRNA versus Cy5 labeled mild RA 
mRNA or Cy5 labeled severe RA mRNA were performed. Each sample (normal, mild and severe) 
was labeled and hybridized to the GeneChip® Human Genome U95 Set from Afifymetrix (Santa 
Qara, CA)which represents about 60,000 full-length genes and EST clusters. 

[00981 Primary data is examined using Incyte Gemtools software and GeneSpring 
version 4.0.4 software (Silicon Genetics, Redwood City, CA). Defective cDNA spots (irregular 
geometry, scratched, or <40% area compared to average) or spot fluorescence hybridizations with 
signal to noise ratios less than 2.5:1 are eliminated from the data set. Data sets are subjected to 
normalization first withm each microarray experiment such that flie median of the Cy5 channel was 
balanced against the ratio of the Cy3 channel (k*(MedianCy3) = MedianCyS. where k is the ratio of 
the median intensities in each). Each microarray contained 192 control genes present as non- 
mammahan single gene "spikes" or "complex targets". The complex targets consist of probe-sets 
that contain a pool of cellular genes expressed in most cell types. In addition, each experimental 
mRNA sample was augmented with incremental amounts of non-mammalian gene RNA (2X, 4X, 
16X, etc) to permit assessment of the dynamic range attained within each microarray. Little 
variation was observed across flie microarray series with respect to the control genes (not shown). 
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providing support for inter-array comparisons of temporally regulated genes. Genes were clustered 
according to their expression pattern by subjecting the log-transformed data (R= log2Cy5/(kCy3), 
where R is the log of the expression ratio for each gene) to the hierarchical tree clustering algorithm 
as implemented in the GeneSpring progi*am (Silicon Genetics). The hierarchical tree analysis was 
perfonned using a minimum distance value of 0.001, separation ratio of 0.5 and the standard 
correlation distance definition. 

[0099] Human sense and antisense RNA probes were synthesized using the RNA 
Transcription Kit (Stratagene, La Jolla, CA). T3 or T7 RNA polymerase produced ^^S- radiolabeled 
antisense or sense single-stranded RNA probes, respectively. A sense probe generated from an 
unrelated human gene was used as a negative control for in situ hybridization. 

[01001 For mild and severe disease, mRNA from patients with severe arthritis (score 
of 4) were used to generate probes that are hybridized to the GeneChip® Human Genome U95 Set 
from Affymetrix (Santa Clara, CA) which represents about 60.000 full-length genes and EST 
clusters, as is mRNA from normal human synovial tissue. Hybridizations are conducted on 
duplicate chips, allowing for the ehmination of genes whose expression levels differed by greater 
than 50% between the duplicate samples. About 60,000 genes and ESTs are represented in the Set. 

[0101] The method above seeks to identify all genes that are differentially expressed in 
human arthritis using a variety of microarrays or DNA chips. Using the information identified in 
Examples 9-11 a •'human Rheumatoid Arthritis genechip" is produced. 

Example 12 

Method for the production of a "hum an Rheumatoid Arthritis genechip" 
[0102] The genes that are found to be differentially expressed in Examples ,9-11 are 

used to produce a "human Rheumatoid Arthritis genechip." This chip will be used for the 

diagnosis, prognosis, and treatment of the disease. 

[0103] Other chips are produced with those differentially expressed genes that are only 

expressed in mild disease, a "mild RA" chip and those that are only differentially expressed in 

severe disease, a "severe RA" chip. 

Bxaryiple 13 
Method for the diagnosis and staging of RA 
[0104] mRNA is isolated from human synovial tissue, blood and human synovial fluid 
and treated as in Example 2. The microarray produced in Example 12 is analyzed for gene 
expression. From the analysis of up-and down-regulated genes a diagnosis and analysis of disease 
is made. The patient is monitored periodically during active disease and/or treatment. A prognosis 
is made based on these results as to the severity and chronic nature of the disease as well as the 
speed of deformity. 



-37- 



wo 03/072827 PCT/US02/35433 

Example 14 

Tf^ft^Pn t >>v j nViihitinp expr '': ?«ir>n r.f up-rePiilated eencs 
[0105] One or more of the genes that are up-regulated in Examples 4-6 are inhibited 
using antisense oligonucleotides or triple helix oligonucleotides. The antisense oligonucleotides are 
produced using methods known to one of skill in the art. The antisense oligonucleotides are 
administered intravenously, intramuscularly, or within a joint and the symptoms and disease is 
monitored. 

P.yample 15 

-^r ^^tm^ pt B A hv activ ^ t i^ ff t^^ressinn of down-rgpukfed RCTeS 
101061 One or more of the genes that are down-regulated in Example 7 are activated 
using known transcriptional activators. Alternatively, expression vectors are administered that are 
targeted to the synovia and express one or more of the genes that are down-regulated. Preferably, 
the expression vectors are retroviral and are administered intravenously. The transcriptional 
activators and vectors are produced using methods known to one of skill in the art. 

F.xample 16 

TTftatmrn r "^'^^ ^ hv adn ifnirfrarinn of dmvn-rMnilated TTOtems 
fOlOTl One or more of the proteins that are down-regulated in Example 7 are purified 
and administered. The proteins are administered intravenously or into the joint 

Rxample 17 

^T;^ ^ fi^ > >^ y^n/.nginpftieti T y-rpi»>t^-H protein to ^hmce. angiogenrfiiff in jovial tisstics and to 

^^ ffinftrtie involvftmRnt in arth ritic processes 
[0108] Because laimers for fibrinogen/angiopoietm-related protein amplified a 270 
base pair product fiom cDNA synthesized from mRNA from synovial tissues of RA patients, this 
suggests that this protein is involved in some way in the pathogenic process. Thus, expression of 
fibrinogen/angiopoietin-related protein is analyzed in various forms of RA and in situ in synovial 
tissue. If over-expression is identified in the process, anti-sense ohgonucleotides are used to inhibit 
expression of fibrinogen/angiopoietin-related protein in synovia or systemically in the RA patients. 

F.xample 18 

netermination of the besf treatment for a patient with RA 
[0109] From the results of the gene expression analysis, flie best treatment for fee 

patients with RA is determined. The treatment is based on flie specific gene expression profile. 

[0110] Thus, synovial fluid from a patient with rheumatoid arthritis is analyzed using a 

microanay as in Example 2. The analysis is used to identily the genes that are specifically up- 
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regulated or dovm-regulated in that patient Then, the treatment is selected based on the specific 
gene expression. 

[01111 Although described in the context of certain preferred embodiments, the skiUed 
artisan will appreciate that various changes and modifications can be made to the preferred 
embodiments, and such changes and modifications are meant to be encompassed by the invention, 
as defined by the appended claims. 

Rxam ple 19 

r^r ^^uHnr, nf mRNA n v^^v pression in CIA with human ffPf flTid fimfftion: FABP 
[0112] Microairay analyses identified fibrinogen/angiopoietin related protein (FARP) 
as one of the most highly over-expressed mRNAs (8734 tested) in arthritic paws of mice with 
collagen-induced arthritis (CIA). See Table 1. Cluster C. Mouse # W13905; 
Fibrinogen/angiopoietin-related protein. Data also demonstrated that human FARP. Data also 
demonstrated that hunianFAia»mRNAi8expressedinrheumatoidaithritis(RA)syno^ FARP 
is highly homologous to angiogenic fectors and inhibits apoptosis of vascular endothelial ceUs in 
vitro. In RA, an increase in blood vessel fonnation, or angiogenesis. is observed in synovial tissue. 
Endothelial cells lining blood vessels can provide nutrients for inflamed tissue, allow infiltration of 
inflammatory cells, and secrete inflammatory cytokines, all of which contribute to disease 
processes. The suppression of arthritis by angiogenic inhibitors in animal models, such as CIA. 
further demonstrates that angiogenesis is necessary for arthritis. Mouse FARP mRNA is highly 
expressed during early stages of CIA and human FARP mRNA is expressed in RA synovial tissue. 

Byamplc20 

rhnmf ^"-'^"g ^^^^ tocnression in QA 
[01131 Prior to the present invention, FARP had not been described in arthritis. 
Localization of the cells that produce FARP mKNA and FOtein within the joint permits analysis 
FARP's role in angiogenesis in CIA. The cell types producing FARP mRNA and protein are 
determined and the role of FARP protein expression as it relates to the mRNA expression during 
CIA is identified. 

[01141 Determination of spatial expression of FARP mRNA during CIA. DBA/1 
mice are immunized with collagen as described in Thornton, et al. (1999) Arthritis Rheum 42:1109- 
1118. Mice are sacrificed 21, 28, 35, 42 and 49 days following primary collagen immunization. In 
situ hybridization analysis of FARP mRNA expression using sense and antisense probes generated 
fi-om the FARP mouse cDNA are performed on tissue sections from paws of normal, unimmunized 

mice and arthritic mice. 

[0115] Generation of antibody to FARP. An anti-FARP antibody is generated as 
described in Kim I, et al, (2000) Biochem J 346:603-610. and used for immunodetection and 
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blocking of FAR? function. Nucleotides 298 to 866 of the cDNA coding for the mouse FARP 
protein are cloned into the mammalian expression vector pcDNA3.1/His, which incorporates a 
histidine tag for easy isolation of the recombinant protein (Invitrogen, Carlsbad, CA). FoUowing 
purification, this protein fragment encoding amino acids 100-289 of mouse FARP is injected into 
rabbits and serum is collected. Polyclonal antibody is purified from rabbit serum by ammonium 
sulfate precipitation and protein A column chromatography as described in Harlow E. et al, (1988) 
Antibodies: A laboratoiy manual. Cold Spring Harbor. NY. Cold Spring Harbor Laboratory; and 
Shanley JD, et al, (1994) J Infect Dis 169:1088-1091. 

[01 16] Determination of spatial and temporal expression of FARP protein during 
CIA. Since protein levels do not always directly reflect mKNA levels of a gene, Ihe protein 
expression of FARP is determined in arthritic CIA paws using the anti-mouse FARP polyclonal 
antibody generated above. FARP protein is localized immunohistochemically using a horseradish 
peroxidase conjugated anti-rabbit secondary antibody. Sections are processed from paws of non- 
immunized mice and from paws of mice sacrificed 21. 28, 35, 42 and 49 days following primary 
collagen injection. Sera from non-immunized rabbits are used as a negative control. Sections from 
mouse liver are used as a positive control for inununoWstochemical staining. 

[0117] Results. /B^/tomRNA analysis demonstrates expression of FARP mRNA in 
the inflamed area of arthritic paws. FARP mRNA and protein are seen to be more highly expressed 
early in disease. In some embodiments, FARP protein is localized to the vasculature in arthritic 
paws. Blood vessel formation in OA paws is readily observed by standard hematoxylin and eosin 
staining. However, co-localization of vasculature and FARP expression is demonstrated by 
analysis of serial sections for expression of endothelial cell-specific markers, such as von 
WUlebrand fector Lu J, et al, (2000) J Immunol 164:5922-5927, in conjunction with FARP 
expression. The anti-human FARP polyclonal Ab from Kim, et. al. will be obtained, as this 
antibody will likely ciossreact with mouse FARP. The homologous portion of mouse FARP 
protein that was previously used by Kim, et. al., is used to generate anti-human FARP polyclonal 
antibodies. Successful use of this polyclonal antibody in immvmohistochemical staining 
demonstiates that administration of this portion of the protein to rabbits can generate polyclonal 
antibody to FARP. Polyclonal antibodies are easier and faster to generate than monoclonal 
antibodies; in some embodiments, the use of an antibody to block FARP fimction involves 
generation of a monoclonal antibody. 

Example 21 

Determining the anti-apoptotic effects of FARP on endothelial cell s 
[01181 The angiogenic protein Angl and FARP have anti-apoptotic eflfects on 
endothelial cells. Angl mediates its anti-apoptotic effects by activating Tie2, an endothelial cell- 
specific receptor, resulting in phosphorylation of the serine-threonine kinase, Akt (protein kinase B) 
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and mRNA upregulation of the apoptosis inhibitor, survivin. Papapetropoulos A, et al, (2000) / 
Biol Chem 275:9102-9105. FARP does not bind Tie2, but is highly homologous to Angl and is a 
secreted protein with anti-apoptotic effects on endothelial cells. FARP also has anti-apoptotic 
effects specific for endofheUal cells, and is a secreted protein. Activation by FAKP of an 
endothehal cell-specific receptor is found to result in the phosphorylation of specific anti-apoptotic 
intracellular molecules and increases mRNA expression of anti-apoptotic fectors. Determination of 
the pathway that FARP utilizes in prolonging endothelial cell survival provides potential targets for 
therapeutic intervention. The effects of FARP on anti-apoptotic factors potentially regulating 
endothehal cell survival is identified. 

10H91 In preferred embodiments, treatments and drug candidates that interfere with 
receptor binding by FARP lead to deactivation of the anti-apoptotic serine-threonine kinase, Akt, in 
endothelial cells. In further preferred embodiments, interference with expression of FARP, normal 
fimction of its receptor, and/or binding of FARP to its receptor also leads to decreased expression of 
survivin. Bcl2, and other anti-apoptotic factors in endothelial cells. Overall, these effects result in 
enhanced or normalized apoptosis of vascular endothelial cells m tlie arthritic joint, leading to a 
diminution or reversal of disease symptoms. 

[0120] Expression and purification of recombinant mouse FARP (rmFARP). The 
entire cDNA coding for mouse FARP is inserted into the mammalian expression vector 
pcDNA3.1/His, which contains a six amino acid histidine tag for easy isolation of the protein 
(Invitrogen). The cDNA is transfected into COS-7 cells and purified firom the cell supernatant The 
anti-mouse FARP polyclonal antibody discussed above is used in Western blots to determine 
whether rmFARP is expressed in COS-7 cells. 

(0121 J Effects of FARP on endothelial cell expression of anti-apoptotic molecules. 
HUVEC (ATCC, Rockville. Maryland) is treated with rmFARP in a range of 50 to 500 ng/ml as 
described for Angl (Papapetropoulos A, et al, (2000) J Biol Chem 275:9102-9105) and or with 
vehicle. RNA fi^m these cells is analyzed by RNase protection assays (BD Pharmingen, San 
Diego, CA) for expression of the anti-apoptotic genes, survivin and Bcl-2, as previously performed 
in Thornton S, et al, (1999) Arthritis Rheum 42:1 109-1118. 

[0122] Effects of rmFARP administration on phosphorylation of serine-threonine 
kinases important in cell survival. Phosphorylation of the Akt survival serine threonine kinase is 
assessed as described in Papapetropoulos A, et al. Microvascular endothelial cells (Vec 
Technologies, Rensselaer, NY) are treated witli and without rmFARP. Anti-Alct antibody (Santa 
Cruz Biotechnology, Inc., Santa Cruz, CA) and phosphospecific Akt antibody (New England 
Biolabs, Beverly, MA) are used in Western blots to determine the amount of Akt protein present 
and the extent of Alct phosphoiylation in these cells. 

[0123] Results. rmFARP is found to increase the expression of survivin or Bcl-2 in 
endothelial cells, and also increases the phosphorylation of Akt. FARP is found to utilize a separate 
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signaling pathway from Angl, and other signaling molecules are thus analyzed for their role in the 
anti-apoptotic effects mediated by FARP. Additionally the anti-apoptotic molecules XIAP, C-IAP2 
and MAP are analyzed at the same time as survivin and Bcl-2 in the RNase protection analysis. 
These studies elucidate FARP's downstream effects that are mediated by a receptor. 

Example 22 
Detemiining th e rnle of FARP dllring CIA 

[0124] Since FARP is one of the most highly overexpressed genes in CIA, and since it 
is also expressed in rheumatoid arthritis synovial tissue, its role in arthritis is tested both by 
administration and depletion of FARP before disease onset and during disease progression. In 
some embodiments, FARP aids in endothelial cell survival, allowing for increased inflammation in 
CIA. Thus, treatment with FARP can exacerbate CIA, and depletion of FARP can inhibit CIA. 
Recombinant mouse FARP, as well as anhTjodies to FARP, are administered before and during 
disease. 

[01251 Effects of administration of rmFAKP on the development and severity of 
CIA. rmFARP is administered i.p. to DBA/1 mice immunized with collagen. Based on puWished 
studies with other molecules (Thornton S, et al, (2000) J Immunol 165:1557-1563), FARP (10 
ug/0.5 mymouse) is administered twice daily from days 14 to 21 following primary collagen 
immunization for testing effects before disease onset. For established disease, FARP is 
administered twice daily for seven days starting 24 hours after disease onset. Mice are scored daily 
for macroscopic signs of arthritis as described m Thornton S, et al, (1999) Arthritis RJieum 42:1109- 
1118. Mice are sacrificed at day 49 of disease and sections from treated and untreated mouse paws 
are analyzed histochemically for blood vessel formation and inflammatory cell infiltration by 
hematoxylin and eosin staining. 

[0126J ElTects of depletion of FARP on endothelial cell apoptosis. Antibody 
produced as described herein is used. Assessment of the ability of anti-FARP antibody to block the 
anti-apoptotic effects of FARP is performed in vitro with endothehal cell lines as described in Kim 
I, et al, (2000) Biochem J 346 Pt 3:603-610. Induction of apoptosis in HUVEC cells is performed 
by serum deprivation. HUVEC cells are grown for 24 hours in the presence of 10% serum and then 
incubated for 24 hours with the same media, or serum-free media with control buffer, rmFARP 
(200 and 800 ng/ml) or rmFARP plus anti-FARP antibody at varying concentrations. Analysis of 
apoptotic cells is as described in Kim, et al. Sera from unimmunized rabbits is used as a negative 
control. 

[0127] Effects of depletion of FARP on the development and severity of CIA. 

Anti-FARP antibody is administei*ed similarly to studies using anti-VEGF antibody in CIA (Sone 
H, et al, (2001) Biochem Biophys Res Comfnun 281:562-568). Antibody is delivered i.p. (200 
ug/0.2 ml/mouse) every other day for 8 days both before (days 14-22) and during disease (24 hours 
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after onset) as desciibed above. Normal rabbit immunoglobulin and PBS are used as negative 
controls. Mice immunized with collagen are analyzed macroscopically and histologically as 
described above. 

[0128] Results. It is found that administration of FARP protein to mice before disease 
onset can hasten the onset of disease, and that administration after disease onset can exacerbate 
disease symptoms and increase vasculature in the inflamed paws. Thus, in preferred enobodiments, 
FARP is deleted by antibody. Li alternative embodiments, a FARP knockout in DBA/1 mice is 
generated. Additionally, since FARP mRNA is synthesized in the rat embryo, it is implicated in 
embryonic development. In preferred embodiments, the antibody produced as described herein can 
block or interfere with the function of FARP. A polyclonal antibody produced in rabbits is 
optimized by using an affinity colunm made of tiie recombinant protein to purify the antibody. An 
alternative approach is to generate a monoclonal antibody. An advantage of using anti-FARP 
antibodies is the benefit of an antibody as a therapeutic agent. 

Example 23 
Involvement of FARP in aneiogenesis 
[01291 FARP mRNA and protein are localized to the vascular endothelium in arthritic 
paws of CIA mice. Study of protein levels in such mice indicates that FARP protein levels 
correlate with FARP mRNA levels. Cells expressing FARP. mRNA and protein during CIA are 
identified, and the kinetics of expression of FARP protein during CIA permits design of therapies 
and testing of candidate drugs having a specific and localized action on FARP mRNA and protein. 
Preferred therapies and drugs result in enhanced or normalized apoptosis of vascular endothelial 
cells in the arthritic joint, leading to a diminution or reversal of disease symptoms. 
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WtfATTSrT.ATMFDIS: 

1 . A method for the diagnosis and analysis of autoimmune disease or arftritides, in a 

patient, comprising: 

obtaining a patient sample containing mRNA; 

analyzing gene expression using tlie mRNA that results in a gene expression 
signature of that mRNA, wherein said gene expression signature comprises the identification and 
quantitation of gene expression from genes that have been identified as being differentially 

expressed in RA; and 

using that gene expression signature to diagnose or analyze the autoimmune disease 
or arthritide in said patient, wherein said gene expression of at least about 60% of said genes 
correlates with titiat of said gene signature. 

2. The method of Qaim I wherein said autoimmune disease or arthritides are selected from 
the group consisting of: Rheumatoid Arthritis, Lupus. Ankylosing Spondylitis, fibrositis. 
fibromyalgia, osteoarthritis. Gout, Juvenile Rheumatoid Arthritis, and an autoimmune disease 
caused by an infectious agent. 

3. The method of Qaim 1 wherein said autoimmune disease or arthritide is rheumatoid 
arthritis. 

4. The method of Qaim 1 wherein said patient is selected from the group consisting of. a 
human, a primate, a dog, a cat, a horse, and a sheep. 

5. The mettiod of Claim 1. wherein said analysis is selected from the group consisting of: an 
analysis of severity of the disease, an analysis of pain manifestation, an analysis of deformity, an 
analysis of treatment methods, and an analysis of treatment efficacy. 

6. The method of Claim 1 wherein said gene expression analysis involves at least about 10 
genes that are identified as differentially expressed in arthritis. 

7. The method of Claim 1 wherein said gene expression analysis involves at least about 50 
genes that are identified as differentially expressed in arthritis. 

8. The method of Claim 1 wherein said gene expression analysis involves at least about 100 
genes that are identified as differentially expressed in arthritis, 

9. The method of ciaim 1, wherein said genes identified are expressed at least about 1.5 fold 
higher or lower than normal. 

10. The method of Claim 1, wherein said genes identified are expressed at least about 2 fold 
higher or lower than normal. 

11. The method of Claim 1, wherein said genes identified are expressed at least about 3 fold 
higher or lower than normal. 

12. The method of Claim I, wherein said genes are selected from the group consisting of the 
385 genes or ESTs in Table 1 (SEQ ID NOS: 1-385), homologs. or variant thereof. 
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13. The method of Claim 1 , wherein said genes are selected from the group consisting of: 
the genes in cluster A. 

14. The method of Qaim 13, wherein the genes in cluster A are down-regulated (SEQ ID 
NOS:l-37) at least about 2 fold. 

15. The method of Claim 1, wherein said genes are selected from the group consisting of: the 
genes in cluster B. 

16. The method of Claim 15, wherein the genes in cluster B are up-regulated (SEQ ID N0S:1- 
37) at least about 2 fold only in late or severe disease. 

17. The method of Claim 1 , wherein said genes are selected from the group consisting of: 
the genes in cluster C. 

18. The method of Qaim 17, wherein the genes in cluster C are up-regulated (SEQ ID N0S:1- 
37) at least about 2 fold only in early or mild disease. 

19. The method of Claim 1. wherein said genes are selected from the group consisting of: 
the genes in cluster D. 

20. The method of Qaim 19, wherein the genes in cluster D are up-regulated (SEQ ID NOSil- 
37) at least about 2 fold in early or mild disease and more in late or severe disease. 

21 . The method of Claim 1, wherein said genes are selected from the group consisting of: 
the genes in cluster E. 

22. The method of Claim 21, wherein the genes in cluster E are up-regulated (SEQ ID N0S:1- 
37) at least about 2 fold in both early or mile and late or severe disease. 

23. The method of Qaim 1 wherein said differentially expressed genes are the 385 genes 
identified as SEQ ID NOS:l-385. 

24. The method of Claim 1 wherein if the genes in clusters B or D are upregulated, the disease 
is diagnosed as severe. 

25. The method of Claim 1 wherein if the genes in cluster A are upregulated, the disease is 
diagnosed as moderate to low-grade. 

26. The metliod of Claim 1, wherein said gene expression of at least about 70% of said genes 
correlates with that of said gene signature. 

27. The method of Claim 1 , wherein said gene expression of at least about 80% of said genes 
correlates with that of said gene signature. 

28. The method of Claim 1, wherein said gene expression of at least about 90% of said genes 
correlates with that of said gene signature. 

29. The method of Claim 1, wherein said gene expression of at least about 95% of said genes 
coixelates with that of said gene signature. 

30. A method for the treatment of RA comprising: 

down-regulating at least one of the genes identified in clusters B througih D. 



wo 03/072827 PCT/US02/3S433 

31. The method of Claim 30 wherein said down-regulation is by adding antisense 
oligonucleotides specific for the gene that is being down-regulated. 

32. The method of Claim 30 wherein said down-regulation is by adding or expressing an 
repressor of the gene that is being down-regulated. 

33 . A method for the treatment of RA comprising: 
up-regulating at least one of the genes in cluster A. 

34. The method of claim 33 wherein said up-regulation is by adding or expressing a 
transcriptional activator of the gene that is being up-regulated. 

35 . The method of claim 33 wherein said up-regulation is by adding a vector that expresses the 
protein encoded by the gene that is being up-regulated. 

36. A method for the identification of genes for targeting in the treatment of rheumatoid 
arthritis in a mammal other flian a mouse, comprising: 

identifying homologs of SEQm NOS:l-385. 

37. A method for the diagnosis of rheumatoid arthritis in a mammal, comprising: 

obtaining a tissue or fluid sample from a diseased patient; , 
isolating mRNA from said sample; 

using the isolated mRNA to analyze the gene expression of at least about 40 genes, 
selected from the group consisting of SEQ ID NOS:l-385 or a homolog thereof, obtaining a 
fingerprint of the patient's gene expression; 

identifying whether at least about 60% of said fingerprint is at least about 2 fold 

differentially expressed from that of a normal patient. 

38. An array or a genechip. specific for rheumatoid arthritis, comprising at least 10 of the genes 
selected from the group consisting of SEQ ID NOS:l-385 or homologs thereof 

39. The array or genechip of Claim 38, comprising at least 40 of the genes selected from the 
group consisting of SEQ ID NOS: 1-385 or homologs thereof. 

40. The array or genechip of Claim 38, comprising at least 50 of the genes selected from the 
group consisting of SEQ ID NOS: 1-385 or homologs tiiereof. 

41. The array or genechip of Claim 38, comprising at least 75 of the genes selected from the 
group consisting of SEQ ID NOS: 1-385 or homologs thereof 

42. The array or genechip of Claim 38, comprising at least 100 of the genes selected tmm the 
group consisting of SEQ ID NOS:l-385 or homologs thereof 

43. An array or a genechip, specific for rheumatoid arthritis consisting essentially of, at least 10 
of the genes selected from the group consisting of SEQ ID NOS:l-385 or homologs thereof. 

44. The array or genechip of Claim 43, consisting essentially of at least 40 of the genes selected 
from the group consisting of SEQ ID NOS: 1-385. 

45. The array or genechip of Claim 43, consisting essentially of SEQ TD NOS:l-385. 
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46. The array or genechip of Claim 38, wherein said genes allow for flie identification of the 
severity of tlie disease. 

47. Tlie array or genechip of Claim 38, wherein said genes allow for the prognosis of the 
disease. 

48. The airay or genechip of Claim 38, wherein said genes allow for the diagnosis of the 
disease. 

49. The array or genechip of Claim 38, wherein said genes allow for the identification of the 
most efficacious treatment of the disease in a specific patient. 

50. A method for the diagnosis or analyses of autoimmune disease or rheumatoid arthritis, 

comprising 

obtaining mRNA from a patient; 

using the mRNA as a probe for the analysis of the array or genechip of Claim 38; 
comparing the results obtained with those of a normal patient. 

51. A method of screening the efficacy of a candidate drug in vitro for the treatment of 
collagen-induced arthritis comprising: 

identifying vascular endothelial cells expressing FARP mRNA and protein; 
introducmg a candidate drug to said endothelial cells; and 

evaluating whether said candidate drug causes enhanced or normalized apoptosis of 
vascular endothelial cells. 

52. A method of reducing the symptoms associated with collagen-induced arthritis conaprising: 

identifying a subject suffering from collagen-induced arthritis; and 
administering a compound effective to deplete at least one of the group of FARP 
mRNA, FARP protein, FARP receptor binding, and FARP activity. 

53. The method of claim 52, wherein said compound is an anti-FARP antibody. 

54. The methof of claim 53, wherein said antibody interferes with binding of FARP to a FARP 
receptor. 
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